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CHAPTER 1 


Vector Spaces 


Suppose a and b are real numbers, not both 0. Find real numbers c and d 
such that 


1/(a+ bi) = c + di. 


SOLUTION: Multiplying both the numerator and the denominator of the 
left side of the equation above by a — bi gives 


a—bi f 
oip erd 
Thus we must have 
a —b 
= a= =. 
© +h ane a? +b?’ 


because a and b are not both 0, we are not dividing by 0. 


COMMENT: Note that these formulas for c and d are derived under the 
assumption that a + bi has a multiplicative inverse. However, we can forget 
about the derivation and verify (using the definition of complex multiplica- 
tion) that 


\(__@ b o AL 
(0+ bi)( soe ari) oh 


which shows that every nonzero complex number does indeed have a multi- 
plicative inverse. 
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2. Show that 


-14+ V3i 
2 


is a cube root of 1 (meaning that its cube equals 1). 
SOLUTION: Using the definition of complex multiplication, we have 
= + a _zl-v3i 
2 2 i 
Thus 


("y= AG ye) 


3. Prove that —(—v) = v for every ve V. 
SOLUTION: Let v €V. By the definition of additive inverse, we have 
v +(—v) = 


The additive inverse of —v, which by definition is —(—v), is the unique vector 
that when added to —v gives 0. The equation above shows that v has this 
property. Thus —(—v) = v. 


COMMENT: Using 1.6 twice leads to another proof that —( —v) =v. How- 
ever, the proof given above uses only the additive structure of V, whereas a 
proof using 1.6 also uses the multiplicative structure. 


4, Prove that if a € F, v € V, and av = 0, then a = 0 or v = 0. 
SOLUTION: Suppose that a € F, v EV, and 
av = 0. 


We want to prove that a = 0 or v = 0. If a = 0, then we are done. So 
suppose that a # 0. Multiplying both sides of the equation above by 1/a 
gives 


* (av) => 
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The associative property shows that the left side of the equation above equals 
lv, which equals v. The right side of the equation above equals 0 (by 1.5). 
Thus v = 0, completing the proof. 


5. For each of the following subsets of F3, determine whether it is a subspace 
of F°: 


(a) {(r1, 22,23) € F? : z1 + 2x2 + 323 = O}; 
(b) {(x1, 22,23) E€ F? : z1 + 2x2 + 323 = 4}; 
(c) {(r1, 22,23) € F? : zyx2r3 = 0}; 
(da) {(r1, 22,23) € F? : x = 523}. 
SOLUTION: (a) Let 
U = {(21, £2, £3) € F? : z1 + 229 + 323 = 0}. 


To show that U is a subspace of F°, first note that (0,0,0) € U, so U is 
nonempty. 
Next, suppose that (£1, £2,273) € U and (y1, y2,y3) € U. Then 


Tı + 272 + 323 = 0 
yi + 2yo + 3y3 = 0. 


Adding these equations, we have 
(z1 + y1) + 2(xe2 + yo) + 3(z3 + y3) = 0, 
which means that (x; + y1, £2 + yo,73 + y3) € U. Thus U is closed under 
addition. 
Next, suppose that (£1, 22,23) € U anda € F. Then 
£1 + 2x2 + 323 = 0. 
Multiplying this equation by a, we have 
(azı) + 2(az2) + 3(ax3) = 0, 


which means that (az1,az2,ar3) € U. Thus U is closed under scalar multi- 
plication. 

Because U is a nonempty subset of F3 that is closed under addition and 
scalar multiplication, U is a subspace of F3. 
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(b) Let 
U = {(x1, 22,23) € F3 : x) + 229 + 323 = 4}. 


Then (4,0,0) € U but 0(4,0,0), which equals (0,0,0), is not in U. Thus U 
is not closed under scalar multiplication. Thus U is not a subspace of F3. 
(c) Let 


U= {(r1, 22, z3) € F°: T1T2T3 = 0}. 


Then (1, 1,0) € U and (0,0,1) € U, but the sum of these two vectors, which 
equals (1, 1, 1), is not in U. Thus U is not closed under addition. Thus U is 
not a subspace of F3, 

(d) Let 


U = {(x1, 22,23) E€ F? : T1 = 5x3}. 


To show that U is a subspace of F3, first note that (0,0,0) € U, so U is 
nonempty. 
Next, suppose that (z1, 22,23) € U and (y1, y2,y3) € U. Then 


Tı = 523 
Yi = 5y3- 
Adding these equations, we have 
T1 + yi = 5(z3 + ys), 


which means that (£1 + yi, 22 + y2,z3 + y3) € U. Thus U is closed under 
addition. 
Next, suppose that (21, 22,73) EU anda € F. Then 


r= 5x3. 
Multiplying this equation by a, we have 
az) = 5(az3), 


which means that (azı, az2,ax3) € U. Thus U is closed under scalar multi- 
plication. 

Because U is a nonempty subset of F3 that is closed under addition and 
scalar multiplication, U is a subspace of F3. 
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6. Give an example of a nonempty subset U of R? such that U is closed under 
addition and under taking additive inverses (meaning —u € U whenever 
u € U), but U is not a subspace of R?. 


SOLUTION: Let U = {(m,n) : m and n are integers}. Then clearly 
U is closed under addition and under taking additive inverses. However, 
(1,1) € U but $(1,1), which equals (3,4), is not in U, so U is not closed 
under scalar multiplication. Thus U is not a subspace of R?. 

Of course there are also many other examples. 


7. Give an example of a nonempty subset U of R? such that U is closed under 
scalar multiplication, but U is not a subspace of R?. 


SOLUTION: Let U be the union of the two coordinate axes in R2. More 
precisely, let 


U = {(z,0): cE R}U{(0,y): ye R}. 


Then clearly U is closed under scalar multiplication. However, (1,0) and 
(0,1) are in U but their sum, which equals (1,1) is not in U, so U is not 
closed under addition. Thus U is not a subspace of R?. 

Of course there are also many other examples. 


8. Prove that the intersection of any collection of subspaces of V is a subspace 
of V. 


SOLUTION: Suppose {Ua}acr is a collection of subspaces of V; here T 
is an arbitrary index set. We need to prove that [ser Ua, which equals the 
set of vectors that are in U, for every a €T, is a subspace of V. 

The additive identity 0 is in Ua for every a € T (because each Ua is a 
subspace of V). Thus 0 € (),¢p Ua. In particular, Maer Va is a nonempty 
subset of V. 

Suppose u,v € [laerUa. Then u,v. E€ Ua for every a € I. Thus 
u +v EUa for every a €T (because each Ua is a subspace of V). Thus 
u +v € flaer Va- Thus flaer Va is closed under addition. 

Suppose u € flaer Ua and a € F. Then u € Ua for every a € T. Thus 
au € Ua for every a € I (because each Ua is a subspace of V). Thus 
au € faer Ua. Thus fzer Ua is closed under scalar multiplication. 

Because (laer Ua is a nonempty subset of V that is closed under addition 
and scalar multiplication, (),<p Ua is a subspace of V. 


COMMENT: For many students, the hardest part of this exercise is un- 
derstanding the meaning of an arbitrary intersection of sets. Instructors who 
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10. 


11. 


do not want to deal with this issue should change the exercise to “Prove that 
the intersection of any finite collection of subspaces of V is a subspace of V.” 
Many students will then prove that the intersection of two subspaces of V 
is a subspace of V and use induction to get the result for finite collections 
of subspaces. 


Prove that the union of two subspaces of V is.a subspace of V if and only if 
one of the subspaces is contained in the other. 


SOLUTION: Suppose U and W are subspaces of V such that UUW is a 
subspace of V. We will use proof by contradiction to show that U C W or 
W CU. Suppose that our desired result is false. Then U ¢ W and W ¢ U. 
This means that there exists u € U such that u ¢ W and there exists w € W 
such that w ¢ U. Because u and w are both in U U W, which is a subspace 
of V, we can conclude that u+ w E UUW. Thusu+weU orutweW. 

First consider the possibility that u+w € U. In this case w, which equals 
(u + w) + (—u), would be in the sum of two elements of U and hence we 
would have w € U, contradicting our assumption that w ¢ U. 

Now consider the possibility that u++w € W. In this case u, which equals 
(u + w) + (—w), would be in the sum of two elements of W and hence we 
would have u € W, contradicting our assumption that u ¢ W. 

The two paragraphs above show that u + w ¢ U and u +w ¢ W, con- 
tradicting the final sentence of the first paragraph of this solution. This 
contradiction completes our proof that U CW or W CU. 

The other direction of this exercise is trivial: if we have two subspaces 
of V, one of which is contained in the other, then the union of these two 
subspaces equals the larger of them, which is a subspace of V. 


Suppose that U is a subspace of V. What is U +U? 


SOLUTION: By definition, U +U = {u+v : u,v € U}. Clearly U CU+U 
because if u € U, then u equals u + 0, which expresses u as a sum of two 
elements of U. Conversely, U +U C U because the sum of two elements of U 
is an element of U (because U is a subspace of V). Conclusion: U +U =U. 


Is the operation of addition on the subspaces of V commutative? Associa- 
tive? (In other words, if U1, U2, U3 are subspaces of V, is U1 + Ug = U2 +U,? 
Is (U1 + U2) + Uz = U; + (U2 + U3)?) 


SOLUTION: Suppose U1, U2, U3 are subspaces of V. 
A typical element of U; + U2 is a vector of the form u; +u2, where u; € U; 
and u2 € U2. Because addition of vectors is commutative, u; + ug equals 
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12. 


13. 


14. 


u2 +1, which is a typical element of U2 + U1. Thus Ui + Ug = U2 + U1. In 
other words, the operation of addition on the subspaces of V is commutative. 

A typical element of (U1 + U2) +U3 is a vector of the form (uj +u2) +u3, 
where u; € Uj, u2 € Us, and u3 € U3. Because addition of vectors is 
associative, (uy + u2) + ug equals ui + (u2 + u3), which is a typical element 
of U, + (U2 + U3). Thus (U1 + U2) + U3 = Uy + (U2 + U3). In other words, 
the operation of addition on the subspaces of V is associative. 


Does the operation of addition on the subspaces of V have an additive 
identity? Which subspaces have additive inverses? 


SOLUTION: The subspace {0} is an additive identity for the operation 
of addition on the subspaces of V. More precisely, if U is a subspace of V, 
then U + {0} = {0} +U =U. 

For a subspace U of V to have an additive inverse, there would have to 
be another subspace W of V such that U + W = {0}. Because both U and 
W are contained in U + W, this is possible only if U = W = {0}. Thus {0} 
is the only subspace of V that has an additive inverse. 


Prove or give a counterexample: if Ui, U2, W are subspaces of V such that 
UL+W=U +W, 
then U; = U2. 


SOLUTION: To construct a counterexample for the assertion above, 
choose V to be any nonzero vector space. Let U) = {0}, U2 = V, and 
W =V. Then U, +W and U2 + W are both equal to V, but U, #4 U2. 

Of course there are also many other examples. 


Suppose U is the subspace of P(F) consisting of all polynomials p of the 
form 


p(z) = az? + b25, 
where a,b € F. Find a subspace W of P(F) such that P(F) = U @W. 


SOLUTION: Let W be the set of all polynomials (with coefficients in F) 
whose z?-coefficient and 25-coefficient both equal 0. Then every polynomial 
in P(F) can be written uniquely in the form p + q, where p€ U andq E W. 
Thus P(F) = U ẹ W. 
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15. 


COMMENT: There are other possible choices for W that give a correct 
solution to this exercise, but the choice for W made above is certainly the 
most natural one. 


Prove or give a counterexample: if U}, U2, W are subspaces of V such that 
V=U,0W and V =U W, 
then U; = Up. 


SOLUTION: To construct a counterexample for the assertion above, let 
V = F?, let U, = {(z,0) : z € F}, let U2 = {(0,y) : y € F}, and let 
W = {(z,z):2€ F}. Then 


F?=U,@W and F?=U2.0W, 


as is easy to verify, but U, # Us. 
Of course there are also many other examples. 


CHAPTER 2 


Finite-Dimensional 
Vector Spaces 


Prove that if (v1,-..,Un) spans V, then so does the list 
(vi — U2, V2 — U3,- -3 Un—1 — Un, Un) 


obtained by subtracting from each vector (except the last one) the following 
vector. 


SOLUTION: Suppose (v1,..., Un) spans V. Let v € V. To show that 


v € span(vı — V2, U2 — U3,---;Un—1 — Un; Un), we need to find ai,...,@n E F 
such that 


v = a (v — v2) + az(v2 — v3) +... an—1(Un-1 — Un) + ann. 


Rearranging terms of the equation above, we see that we need to find 
Q1,...,;@, E F such that 


(a) = v= amı + (az — a1) v2 + (a3 — ag)ug + +++ + (an — On_1)Up- 
Because (v1, ..., Un) spans V, there exist b),...,6, € F such that 
(b) v = biv; + bova + b3u3 + +++ + bpp. 


Comparing equations (a) and (b), we see that (a) will be satisfied if we 
choose a; to equal b, and then choose az to equal bz + a; and then choose 
a3 to equal b3 + a2, and so on. 
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2. Prove that if (v1, .-., Un) is linearly independent in V, then so is the list 
(v1 — v2, V2 — U3,..+,Un—1 — Uns Un) 


obtained by subtracting from each vector (except the last one) the following 
vector. 


SOLUTION: Suppose (v1,...,Un) is linearly independent in V. To prove 
that the list displayed above is linearly independent, suppose a1,...,a, € F 
are such that 


ai (v1 — v2) + ag(v2 — v3) + +++ + Gn-1(Un-1 — Un) + anUn = 0. 
Rearranging terms, the equation above can be rewritten as 
aiv, + (a2 — a1) v2 + (a3 — a2)v3 +-++ + (Gn — an—-1)Un = 0. 


Because (v1,..., Un) is linearly independent, the equation above implies that 


a; =0 
a2—a,;=0 
a3—a,=0 


an — Qn; =Q. 


The first equation above tells us that a; = 0. That information, combined 
with the second equation, tells us that ag = 0. That information, combined 
with the third equation, tells us that a3 = 0. Continue in this fashion, 
getting a) = -+ = an = 0. Thus (vı — vo, v2 — ¥3,...,Un—1 — Un; Un) is 
linearly independent. 


3. Suppose (v1,...,Un) is linearly independent in V and w € V. Prove that if 
(vi +w,...,Un + w) is linearly dependent, then w € span(v1,..., Un). 


SOLUTION: Suppose (vı + w,...,U, + w) is linearly dependent. Then 
there exist scalars a),...,@,,, not all 0, such that 


ailu +w) +--+ anun + w) = 0. 
Rearranging this equation, we have 


aV +--+ +@ntp = — (ai +---+an)w. 
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If ay +---+an were 0, then the equation above would contradict the linear 
independence of (v1,...,Un). Thus a; +--+- + an # 0. Hence we can divide 
both sides of the equation above by —(a; + ---+ an), showing that w € 
span(v),..., Un). 


4. Suppose m is a positive integer. Is the set consisting of 0 and all polynomials 
with coefficients in F and with degree equal to m a subspace of P(E)? 


SOLUTION: The set consisting of 0 and all polynomials with coefficients 
in F and with degree equal to m is not a subspace of P(F) because it is not 
closed under addition. Specifically, the sum of two polynomials of degree m 
may be a polynomial with degree less than m. For example, suppose m = 2. 
Then 7 + 4z + 52” and 1 + 2z — 5z? are both polynomials of degree 2 but 
their sum, which equals 8 + 6z, is a polynomial of degree 1. 


5. Prove that F© is infinite dimensional. 


SOLUTION: For each positive integer m, let em be the element of F© 
whose mt? coordinate equals 1 and whose other coordinates equal 0: 


em = (0,..-,0,1,0,...). 
f 


m*? coordinate 
Then (e1,...,€m) is a linearly independent list of vectors in F”, as is easy 
to verify. This implies, by the marginal comment attached to 2.6, that F© 
is infinite dimensional. 


6. Prove that the real vector space consisting of all continuous real-valued 
functions on the interval [0, 1] is infinite dimensional. 


SOLUTION: Let V denote the real vector space of all continuous real- 
valued functions on the interval [0,1]. For each positive integer m, the list 
(1,z,...,2™) is linearly independent in V (because if ag,...,@m € R are 
such that 


ao + ayr+---+an2™" =0 


for every x € [0,1], then the polynomial above has infinitely many roots 
and hence all its coefficients must equal 0). This implies, by the marginal 
comment attached to 2.6, that V is infinite dimensional. 
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7. Prove that V is infinite dimensional if and only if there is a sequence 
v1, U2,... of vectors in V such that (v),...,vn) is linearly independent for 
every positive integer n. 


SOLUTION: First suppose that V is infinite dimensional. Choose v; 
to be any nonzero vector in V. Choose vo, v3,... by the following induc- 
tive process: suppose that v1,...,U,—1 have been chosen; choose any vector 
Un E V such that vn ¢ span(v,..., Up—1)—because V is not finite dimen- 
sional, span(v1,...,Un—1) cannot equal V so choosing vpn in this fashion is 


possible. The linear dependence lemma (2.4) implies that (v,...,Un) is 
linearly independent for every positive integer n, as desired. 
Conversely, suppose there is a sequence vj, v2,... of vectors in V such 


that (v1,..., Un) is linearly independent for every positive integer n. This 
implies, by the marginal comment attached to 2.6, that V is infinite dimen- 
sional. 


8. Let U be the subspace of R® defined by 
U = {(£1, £2, £3, £4, £5) € RË : zı = 329 and T3 = Tz4}. 
Find a basis of U. 
SOLUTION: Obviously 
U = {(329, £2, 114, £4, T5) : £2, T4, £5 E R}. 
From this representation of U, we see easily that 
((3, 1,0,0, 0), (0,0, 7, 1,0), (0,0, 0,0, 1)) 


is a basis of U. 
Of course there are also other possible choices of bases of U. 


9. Prove or disprove: there exists a basis (po, p1, p2, p3) of P3(F) such that none 
of the polynomials po, p1, p2, p3 has degree 2. 


SOLUTION: Define po, pı, p2, p3 € P3(F) by 
po(z) = 1, 
pi(z) =z, 
po(z) = 27 +23, 


p3(z) = 2°. 
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10. 


11. 


12. 


13. 


None of the polynomials po, pı, p2,p3 has degree 2, but (po, P1, p2, p3) is a 
basis of P3(F), as is easy to verify. 

Of course there are also other possible choices of bases of P3(F) without 
using polynomials of degree 2. 


Suppose that V is finite dimensional, with dim V = n. Prove that there 
exist one-dimensional subspaces Uj,...,U, of V such that 


V=U\6:---@®U,. 


SOLUTION: Let (v1,...,0n) be a basis of V. For each j, let U; equal 
span(v;); in other words, U; = {avj : a € F}. Because (v, ...,Un) is a basis 
of V, each vector in V can be written uniquely in the form 


QU +:+-+GnUn, 


where a1,...,@n E F (see 2.8). By definition of direct sum, this means that 
V=(6:---@U,. 


Suppose that V is finite dimensional and U is a subspace of V such that 
dim U = dim V. Prove that U = V. 


SOLUTION: Let (u1,... Un) be a basis of U. Thus n = dimU, and 
by hypothesis we also have n = dimV. Thus (uj,...,un) is a linearly 
independent (because it is a basis of U) list of vectors in V with length 
dim V. From 2.17, we see that (u1,..., Un) is a basis of V. In particular 
every vector in V is a linear combination of (u1,...,un). Because each 
u; E U, this implies that U = V. 


Suppose that po, P1, .--, Pm are polynomials in P,,(F) such that p;(2) =0 
for cach j. Prove that (po, P1, ---, Pm) is not linearly independent in Pm(F). 


SOLUTION: Because p;(2) = 0 for each j, the constant polynomial 1 is 
not in span(po, - - -, Pm). Thus (po,...,Pm) is not a basis of P,,(F). Because 
(po,---, Dm) is a list of length m +1 and P,,(F) has dimension m + 1, this 
implies (by 2.17) that (po,...,pm) is not linearly independent. 


Suppose U and W are subspaces of RÈ such that dimU = 3, dimW = 5, 
and U + W = RÈ. Prove that U NW = {0}. 


SOLUTION: We know (from 2.18) that 


dim(U + W) = dimU + dim W - dim(UNW). 
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14. 


15. 


Because dim(U + W) = 8, dimU = 3, and dimW = 5, this implies that 
dim(U NW) =0. Thus U NW = {0}. 


Suppose that U and W are both five-dimensional subspaces of RÌ. Prove 
that U NW + {0}. 


SOLUTION: Using 2.18 we have 


9 > dim(U +W) 
= dimU + dim W — dim(UNW) 
= 10 —dim(UNW). 
Thus dim(U NW) > 1. In particular, U NW Æ {0}. 


You might guess, by analogy with the formula for the number of elements 
in the union of three subsets of a finite set, that if U1, U2, U3 are subspaces 
of a finite-dimensional vector space, then 


dim(U, + U2 + U3) 
=dim U, + dim U, + dim U3 
— dim(U; N U2) — dim(U; N U3) — dim(U2 N U3) 
+ dim(U;, NVN U3). 


Prove this or give a counterexample. 
SOLUTION: To give a counterexample, let V = R?, and let 


Ui = {(z,0) 12E R}, 
Uz = { (0,4): y E R}, 
U; = {(z,2): rE R}. 


Then U; + U2 + U3 = R?, so dim(U, + Uz + U3) = 2. However, 
dim U; = dim U2 = dim U3 = 1 
and 
dim(Uy N U2) = dim(U, N Us) = dim(Uz N U3) = dim(U, N U2 N U3) = 0. 


Thus in this case our guess would reduce to the formula 2 = 3, which 
obviously is false. 
Of course there are also many other examples. 
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Prove that if V is finite dimensional and Uj,...,U,;, are subspaces of V, 
then 


dim(U; +--+ Um) < dim U; + --- + dim Um- 


SOLUTION: For each j = 1,...m, choose a basis for Uj. Put these 
bases together to form a single list of vectors in V. Clearly this list spans 
U,+---+Um. Hence the dimension of U1 +---+U,, is less than or equal to the 
number of vectors in this list (by 2.10), which equals dim U} + ---+dim Um. 
In other words, 


dim(U; + --- + Um) < dim U; +- -- + dim Um. 


Suppose V is finite dimensional. Prove that if U}, ..., Um are subspaces of V 
such that V = U1 @ ---® Um, then 


dim V = dim U, + --- + dim Um. 


COMMENT: This exercise deepens the analogy between direct sums of 
subspaces and disjoint unions of subsets. Specifically, compare this exercise 
to the following obvious statement: if a finite set is written as a disjoint 
union of subsets, then the number of elements in the set equals the sum of 
the number of elements in the disjoint subsets. 


SOLUTION: Suppose that Uj,...,Um are subspaces of V such that V = 
Ui @---®Um. For each j = 1,...m, choose a basis for U;. Put these bases 
together to form a single list B of vectors in V. Clearly B spans U;+---+Um, 
which equals V. If we show that B is also linearly independent, then it will 
be a basis of V. Thus the dimension of V will equal the number of vectors B. 
In other words, we will have 


dim V = dimU,; + --- + dim Um, 


as desired. s 

We still need to show that B is linearly independent. To do this, suppose 
that some linear combination of B equals 0. Write this linear combination 
as u1 +---+Um, where we have grouped together the terms that come from 
the basis vectors of U, and called their sum wu, and similarly up to um. 
Thus we have 


uy t-e +Hum = 90, 


where each u; € U;. Because V = U; @--- ® Um, this implies that each uj 
equals 0. Becausc each u; is a linear combination of our basis of U;, all the 
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coefficients in the linear combination defining u; must equal 0. Thus all the 
coefficients in our original linear combination of B must equal 0. In other 
words, B is linearly independent, completing our proof. 


CHAPTER 3 


Linear Maps 


Show that every linear map from a one-dimensional vector space to itself is 
multiplication by some scalar. More precisely, prove that if dim V = 1 and 
T € L(V, V), then there exists a € F such that Tv = av for all v € V. 


SOLUTION: Suppose dim V = 1 and T € L(V,V). Let u be any nonzero 
vector in V. Then every vector in V is a scalar multiple of u. In particular, 
Tu = au for some a € F. 

Now consider a typical vector v € V. There exists b € F such that 
v = bu. Thus 


Tv = T(bu) 
= bT(u) 
= b(au) 
= a(bu) 


= av. 
Give an example of a function f: R? > R such that 
Ff (av) = af(v) 
for alla € R and all v € R? but f is not linear. 
SOLUTION: Define f: R? > R by 
f(z,y) = (2? + 3°)”. 


Then f(av) = af(v) for all a € R and all v € R?. However, f is not linear 
because f(1,0) = 1 and f(0,1) = 1 but 
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f((1,0) + (0,1)) = f(1, 1) 
2 91/3 
# f(1,0) + f (0,1). 
Of course there are also many other examples. 


COMMENT: This exercise shows that homogeneity alone is not enough 
to imply that a function is a linear map. Additivity alone is also not enough 
to imply that a function is a linear map, although the proof of this involves 
advanced tools that are beyond the scope of this book. 


3. Suppose that V is finite dimensional. Prove that any linear map on a sub- 
space of V can be extended to a linear map on V. In other words, show that 
if U is a subspace of V and S € L(U, W), then there exists T € L(V, W) 
such that Tu = Su for all u E€ U. 


SOLUTION: Suppose U is a subspace of V and S € L(U,W). Let 
(ui, -.- Um) be a basis of U. Then (u1, ..., um) is a linearly independent list 
of vectors in V, and so can be extended to a basis (u1,..., Um; Ul, -.., Un) 
of V (by 2.12). Define T € L(V, W) by 


T(arur +... amum + bv; + -. .bnUn) = a1 Suy + -+-+ amSum- 
Then Tu = Su for allu € U. 
COMMENT: Defining T: V — W by 


Sv ifveu; 
Tv = 
0 ifvg¢U. 


does not work because this map is not linear. 


4, Suppose that T is a linear map from V to F. Prove that if u E V is not in 
null 7’, then 


V = nullT @ {au:a € F}. 


SOLUTION: Suppose u € V is not in nullT. Ifa € F and au € nullT, 
then 0 = T (au) = aTu, which implies that a = 0 (because Tu 4 0). Thus 


null TN {au: a € F} = {0}. 


Ifv EV, then 
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= gees ee 
oe Tu Tu” 


Note that T (v — 2u) = Tv-Z2Tu = 0. Thus the equation above expresses 


Tu 
an arbitrary vector v € V as the sum of a vector in nullT and a scalar 


multiple of u. Hence V = nullT + {au:a € F}. Using 1.9, we conclude 
that V = null T @ {au : a € F}. 


Suppose that T € L(V, W) is injective and (v1,..., Un) is linearly indepen- 
dent in V. Prove that (Tv1, ..., Tun) is linearly independent in W. 


SOLUTION: To show that (Tv,..., Tun) is linearly independent, suppose 
a,,.-.;@, E F are such that 


aiTvj +---+anTv, = 0. 

Because T is a linear map, this equation can be rewritten as 
T(aiv + +++ + anun) = 0. 

Because T is injective, this implies that 


aiUi +-+ antn = 0. 


Because (v1, -- - , Un) is linearly independent, the equation above implies that 
a, =---=a, = 0. Thus (Tv, ..., Tun) is linearly independent. 
Prove that if S1,...,S, are injective linear maps such that S)... Sa makes 


sense, then S4... Sn is injective. 


SOLUTION: Suppose that S),...,5, are injective linear maps such that 
S, ...S,, makes sense (which means that the domains of 5,..., Sn are such 
that Sı ...S, is well defined). Suppose v is,a vector in the domain of S4 . .. Sn 
(which equals the domain of S,) such that 


(Sı . ..-Sn)v = 0. 


To show that Sı... Sn is injective, we need to show that v = 0 (see 3.2). To 
do this, rewrite the equation above as 


Si ((S2--. Sn)v) = 0. 


Because S, is injective, this implies that 
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(S2 oe -Sn)v = 0. 
The same argument, now applied to the equation above, shows that 
(S3 eee Sn)v = 0. 


Repeat this process until reaching the equation S,v = 0, which implies 
(because Sn is injective) that v = 0, as desired. 


7. Prove that if (v,...,0n) spans V and T € L(V,W) is surjective, then 
(Tv,...,Tvn) spans W. 


SOLUTION: Suppose that (v1,...,Un) spans V and T € L(V, W) is sur- 
jective. Let w € W. Because T is surjective, there exists v € V such that 
Tv = w. Because (vj,...,Un) spans V, there exist a1,...,@, € F such that 


V = Q)U, +°--+Gntn. 
Applying T to both sides of this equation, we get 


Tu = ayTv +--+ + anT vp. 


Because Tv = w, the equation above implies that w € span(Tv,...,T un). 
Because w was an arbitrary vector in W, this implies that (Tm,..., Tun) 
spans W. 


8. Suppose that V is finite dimensional and that T € L(V,W). Prove that 
there exists a subspace U of V such that U N nullT = {0} and rangeT = 
{Tu: u € U}. 


SOLUTION: There exists a subspace U of V such that 
V=nullT eU; 


this follows from 2.13 (with nullT playing the role of U and U playing the 
role of W). 

From the definition of direct sum, we have U N nullT = {0}. 

Obviously rangeT D {Tu : u € U}. To prove the inclusion in the other 
direction, suppose v € V. Then there exist w € nullT and u’ € U such that 


v=w+u. 


Applying T to both sides of this equation, we have Tv = Tw + Tu! = Tu’. 
Thus Tv € {Tu : u € U}. Because v was an arbitrary vector in V (and thus 
Tv is an arbitrary vector in rangeT), this implies that 
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rangeT C {Tu: ue U}. 
Thus rangeT = {Tu : u € U}, as desired. 
Prove that if T is a linear map from F4 to F? such that 
null T = {(x1,22,23,24) € Fî : 2) = 572 and T3 = 7x4}, 
then T is surjective. | 


SOLUTION: Suppose T € L( F4,F?) is such that nullT is as above. 
Then ((5, 1,0,0), (0,0,7,1)) is a basis of nullT, and hence dim nullT = 2. 
From 3.4 we have 


dim range T = dim F^ — dim null T 
=4-2 
= 2, 


Because range T is a two-dimensional subspace of R?, we have range T = R2. 
In other words, T is surjective. 


Prove that there does not exist a linear map from F® to F? whose null space 
equals 


{(x1, 22,23, 24,25) € F®: 2, = 322 and T3 = 24 = Ts}. 
SOLUTION: Suppose U is the subspace of F5 displayed above. Then 
((3, 1,0, 0,0), (0,0, 1,1, 1)) is a basis of U, and hence dimU = 2. 
If T € £(F°, F?) then from 3.4 we have 


dim null T = dim Fë — dim range T 


= 5 — dim range T 
>3 
> dim U, 


where the first inequality holds because rangeT C F?. The inequality above 
shows that if T € £(F5, F?), then null T Æ U, as desired. 


Prove that if there exists a linear map on V whose null space and range are 
both finite dimensional, then V is finite dimensional. 


SOLUTION: Suppose there exists a linear map T from V into some vec- 
tor space such that nullT and rangeT are both finite dimensional. Thus 
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there exist vectors u1,..-,Um E V and w,..., Wn E rangeT such that 
(u1,-.-;Um) Spans nullT and (w1,..., Wn) spans rangeT. Because each 
wj € rangeT, there exists vj € V such that w; = Tuj. 

Suppose v € V. Then Tv € rangeT, so there exist bi,...,bn € E such 
that 


Tv = bijw; ++--+ byway 
= 6 Tuy +--+ + bnT un 
= T (6,2, +... baUn). 


The equation above implies that T(v — biv — --- — bnUn) = 0. In other 
words, v — bivi — --- — bnaUn € nullT. Thus there exist aj,...,a@m E F such 
that 

v— bivi =, eee — bava = ayu +++ + amum- 


The equation above can be rewritten as 
v = aju +--+ amum + bivi +--+ + dnp. 


The equation above shows that an arbitrary vector v € V is a linear com- 
bination of (u1,...,Um,V1,---,Un). In other words, (u1,..-,tUm,U1,--+;Un) 
spans V. Thus V is finite dimensional. 


COMMENT: The hypothesis of 3.4 is that V is finite dimensional (which 
is what we are trying to prove in this exercise), so 3.4 cannot be used in this 
exercise. 


Suppose that V and W are both finite dimensional. Prove that there exists 
a surjective linear map from V onto W if and only if dim W < dim V. 


SOLUTION: First suppose that there exists a surjective linear map T 
from V onto W. Then 


dim W = dimrangeT 
= dim V — dim null T 
< dim V, 


where the second equality comes from 3.4. 

To prove the other direction, now suppose that dim W < dim V. Let 
(u1,...,Wm) be a basis of W and let (v1, ..., Un) be a basis of V. For 
a1,...,@n € F define T (aiv, + --- + antn) by 


23 


CHAPTER 3. Linear Maps 


13. 


14. 


T(aı +t QnUn) = QW +--+ +amWm- 


Because dim W < dim V, we have m < n and so am on the right side of the 
equation above makes sense. Clearly T is a surjective linear map from V 
onto W. 


Suppose that V and W are finite dimensional and that U is a subspace 
of V. Prove that there exists T € £(V,W) such that null T = U if and only 
if dimU > dim V — dim W. 


SOLUTION: First suppose that there exists T € L(V,W) such that 
null T = U. Then 


dim U = dim null T 
= dim V — dimrangeT 
> dim V — dim W, 


where the second equality comes from 3.4. 

To prove the other direction, now suppose that dim U > dim V — dim W. 
Let (u1,...,Um) be a basis of U. Extend to a basis (uj,...,Um,V1,---;Un) 
of V. Let (wi, ..., Wp) be a basis of W. For a1,...,@m,b1,..-,bn € F define 
Tharu, + +++ + QmUm + bv; +--+ bntun) by 


T(ayuy +- + amum + biv +e + bntun) = biwi +--+ + baw. 


Because dim W > dim V —dimJU, we have p > n and so wpn on the right side 
of the equation above makes sense. Clearly T € L(V, W) and nullT = U. 


Suppose that W is finite dimensional and T € C(V,W). Prove that T is 
injective if and only if there exists S € C(W,V) such that ST is the identity 
map on V. 


SOLUTION: First suppose that T is injective. Define S’: rangeT — V 
by 


S'(Tv) = v; 


because T is injective, each element of rangeT can be represented in the 
form Tv in only one way, so T is well defined. As can be easily checked, S’ 
is a linear map on range T. By Exercise 3 of this chapter, S’ can be extended 
to a linear map S € L(W, V). If v € V, then (ST)v = S(Tv) = S' (Tv) =v. 
Thus ST is the identity map on V, as desired. 
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To prove the implication in the other direction, now suppose that there 
exists S € L(W, V) such that ST is the identity map on V. If u,v € V are 
such that Tu = Tv, then 


u = (ST)(u) = S(Tu) = S(Tv) = (ST)v =v 
and hence u = v. Thus T is injective, as desired. 


Suppose that V is finite dimensional and T € L(V,W). Prove that T is 
surjective if and only if there exists S$ € £(W, V) such that TS is the identity 
map on W. 


SOLUTION: First suppose that T is surjective. Thus W, which equals 
rangeT’, is finite dimensional (by 3.4). Let (wi,..., Wm) be a basis of W. 
Because T is surjective, for each j there exists vj E€ V such that w; = Tuz. 
Define S € L(W, V) by 


S(arw + +--+ amWm) = a01 +--+ + amum- 
Then 


(TS)(aywy +--+ +amWwm) = T (aiv +--+ + @mUm) 
= Tv + -ee H amI Um 
= aW, He H amWm. 
Thus T'S is the identity map on W. 
To prove the implication in the other direction, now suppose that there 
exists S € L(W, V) such that TS is the identity map on W. If w € W, then 


w = T(Sw), and hence w € range T. Thus rangeT = W. In other words, T 
is surjective, as desired. 


Suppose that U and V are finite-dimensional vector spaces and that S € 
L(V,W), T € L(U, V). Prove that 


dim null ST < dim null S + dim null T. 


SOLUTION: Define a linear map T’: null ST — V by T'u = Tu. If 
u € null ST, then S(Tu) =-0, which means that Tu € null S. In other 
words, range T” C null S. Now 


dim null ST = dimnull T’ + dim range T” 
< dimnull T” + dim null S 
< dim null T + dim null S, 


25 CHAPTER 3. Linear Maps 


17. 


18. 


where the first line follows from 3.4 (applied to T”), the second line holds 
because range T” C null S, and the third line holds because of the obvious 
inclusion null 7” C null T. 


Prove that the distributive property holds for matrix addition and matrix 
multiplication. In other words, suppose A, B, and C are matrices whose 
sizes are such that A(B+C) makes sense. Prove that AB + AC makes sense 
and that A(B + C) = AB + AC. i 


SOLUTION: Because A( B+C) makes sense, B and C must have the same 
size. Furthermore, the number of columns of A (let’s call this number n) 
must equal the number of rows of B and C. All this means that AB + AC 
makes sense. 

To prove that A(B + C) = AB + AC, just use the definition of matrix 
addition, the definition of matrix multiplication, and the usual distributive 
property for elements of F. Specifically, let jks bjk, and cj; denote the 
entries in row j, column k of A, B, and C, respectively. The entry in row j, 
column k of B +C is bjk + cj. Thus the entry in row j, column k of 
A(B+C) is 


n 
So air (bre + crk), 
r=1 


which equals 


n n 
> Ajrbek + > Qj rCr,ks 
r=l 


r=1 
which equals the entry in row j, column k of AB + AC, as desired. 


Prove that matrix multiplication is associative. In other words, suppose A, 
B, and C are matrices whose sizes are such that (AB)C makes sense. Prove 
that A(BC) makes sense and that (AB)C = A(BC). 


SOLUTION: This exercise can be done by a brute force calculation, in 
the style of the solution to the previous exercise. Here is a solution that uses 
only the associativity of the product of linear maps (which is easy to verify 
because composition of functions is clearly associative) and the nice property 
that the matrix of the product of two linear maps equals the product of the 
matrices of the two linear maps (see 3.11). 

Suppose A is an m-by-n matrix, B is an n-by-p matrix, and C is a p-by-q¢ 
matrix; the sizes much match up like this in order for (AB)C to make sense. 
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Let R € L(F", F™), S € L(F?, F"),T € £(F4, FP) be such that, with respect 
to the standard bases, M(R) = A, M(S) = B, M(T) = C; 3.19 insures that 
such linear maps exist. Now 


(AB)C = (M(R)M(S))M(T) 
= M(RS)M(T) 
= M((RS)T) 
= M(R(ST)) 
= M(R)M(ST) 
= M(R) (M (S)M (T)) 
= A(BC). 
19. Suppose T € £(F", F™) and that 
@1,1 see Alin 
M(T) = : : , 
aml «+. amn 
where we are using the standard bases. Prove that 
T(z1,..., En) = (01,121 +--+ + alnn,- -âm 121 +++: + amnTn) 
for every (£1,..., £n) € E”. 
COMMENT: This exercise shows T has the form promised on page 39. 


SOLUTION: Let g = (2j,...,2n) € F". Using the standard bases, we 
then have 


M(Trz) = M(T)M(z) 


Qll «+» Gin Tı 


am, --- amn Tn 


Q111 +--+ + alnn 


am, 11 +--+ Am, nTn 


where the first equality comes from 3.14. The last equation implies that 
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Tx = (01,121 +--+ +@1nEn,---,@mi21 +--+ AmnZn), 
as desired. 


20. Suppose (v1,..., Un) is a basis of V. Prove that the function T: V 3 
Mat(n, 1, F) defined by 


Tv = M(v) 


is an invertible linear map of V onto Mat(n, 1, F); here M(v) is the matrix 
of v € V with respect to the basis (u,..., Un). 


SOLUTION: Suppose u,w € V. We can write 
U= AVY +++: +Onvn and w= biv +--+ + bpp 
for some @j,...,@n,61,..-,6n, E€ F. Thus 
u +w = (a; +bi)vi +--+ (an + bn)Un- 
Hence 


T(u + w) = M(u + w) 


ai +b; 

an + bn 

ai bi 
aa eae Crea (abe 

an bn 
= M(u) + M(w) 
= Tu + Tw, 


which shows that T satisfies that additivity property required for linearity. 
If ce F, then 


CU = COV) ++- + CAOnUn. 


Hence 
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T (cu) = M(cu) 
cai 
Can 
ay 
an 
=cM(u) 
= cTu, 


which shows that T satisfies the homogeneity property required for linearity. 
Thus T is linear. 

If Tu = 0, then a, = --- = an = 0, which implies that u = 0. Thus T is 
injective. 

If cj,...,¢n E FE, then 


Tev tiet Cnn : |, 


which implies that T is surjective. 
Because the linear map T is injective and surjective, it is invertible 
(see 3.17). 


Prove that every linear map from Mat(n,1,F) to Mat(m, 1, F) is given by 
a matrix multiplication. In other words, prove that if 


TE L(Mat(n, l, F), Mat(m, 1, F)), 


then there exists an m-by-n matrix A such that TB = AB for every B € 
Mat(n, 1, F). 


SOLUTION: The vector spaces Mat(n,1,F) and Mat(m,1,F) have ob- 
vious bases (consisting of matrices that have 0 in all entries except for a 1 
in one entry). Let A be the matrix of T with respect to these bases. Note 
that if B € Mat(n,1,F), then M(B) = B and M(TB) = TB. Thus 
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TB = M(TB) 
= M(T)M(B) 
= AB, 


where the second equality comes from 3.14. 


Suppose that V is finite dimensional and S,T € L(V). Prove that ST is 
invertible if and only if both S and T are invertible. 


SOLUTION: First suppose ST is invertible. Thus there exists R € L(V) 
such that R(ST) = (ST)R = I. If v € V is such that Tv = 0, then 


v = Iv 
= R(ST)v 
=Q. 


Because v was an arbitrary vector in null T, this shows that null T = {0}. 
Thus T is injective (by 3.2), and hence T is invertible (by 3.21), as desired. 
If u € V, then 


u = Iu 
= (ST) Ru 
= S(T Ru), 
which shows that u € range S. Because u was an arbitrary vector in V, this 
implies that range S = V. Thus V is surjective, and hence V is invertible 
(by 3.21), as desired. 


To prove the implication in the other direction, now suppose that both 
S and T are invertible. Then 


(ST)(T 1S!) = S(TT~)s7} 
=ss-} 
=I 


and 
(T-'s—1)(ST) = TS- S)T 


=T'T 
=I. 
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Thus T~!S~—! satisfies the properties required for an inverse of ST. Thus 
ST is invertible and (ST)~! = T7197}. 


Suppose that V is finite dimensional and S,T € L(V). Prove that ST = I 
if and only if TS = I. 


SOLUTION: First suppose that 
Sr =; 


Because J is invertible, the previous exercise implies that S and T are both 
invertible. Multiply both sides of the equation above by T—! on the right, 
getting 


SET. 
Now multiply both sides of the equation above by T on the left, getting 
TS =I, 


as desired. 

To prove the implication in the other direction, simply reverse the roles 
of S and T in the direction we have already proved, showing that if TS = J, 
then ST = I. 


Suppose that V is finite dimensional and T € L(V). Prove that T is a scalar 
multiple of the identity if and only if ST = TS for every S € L(V). 


SOLUTION: First suppose that T = aI for some a € F. Let S € L(V). 
Then 


To prove the implication in the other direction, suppose now that ST = 
TS for all S € L(V). We begin by proving that (v, Tv) is linearly dependent 
for every v € V. To do this, fix v € V, and suppose that (v, Tv) is linearly 
independent. Then (v,Tv) can be extended to a basis (v, Tv, u,..., tn) 
of V. Define S € L(V) by 


S(av + bTvu + ciui +--+ + Cntin) = bv. 
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Thus S(Tv) = v and Sv = 0. Thus the equation S(Tv) = T(Sv) becomes 
the equation v = 0, a contradiction because (v, Tv) was assumed to be 
linearly independent. This contradiction shows that (v, Tv) is linearly de- 
pendent for every v € V. This implies that for each v € V \ {0}, there exists 
@y E F such that 


Tv = av. 


To show that T is a scalar multiple of the identity, we must show that a, 
is independent of v. To do this, suppose v, w € V \ {0}. We want to show 
that a, = ay. First consider the case where (v, w) is linearly dependent. 
Then there exists b € F such that w = bv. We have 

ayw = Tw 
= T(bv) 
= bTv 
= b(ayv) 
= QW, 


which shows that ay = aw, as desired. 
Finally, consider the case where (v, w) is linearly independent. We have 


Qvtu(v + w) = T(v + w) 
= Tv + Tw 


= AyV + QwwW, 
which implies that 
(autu — dy) + (dutw — @w)w = 0. 


Because (v,w) is linearly independent, this implies that ay+w = a, and 
Quiw = Ay, SO again we have a, = dy, as desired. 


Prove that if V is finite dimensional with dim V > 1, then the set of non- 
invertible operators on V is not a subspace of L(V). 


SOLUTION: Suppose that V is finite dimensional with dimV > 1. Let 
n = dim V and let (v1,...,un) be a basis of V. Define S,T € L(V) by 


S(aiv +--+ QnUn) = ayjvy 


and 
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T(ayvy +--+ + ann) = a202 + +--+ Ann. 


Then S is not injective because Sug = 0 (this is where we use the hypothesis 
that dim V > 1), and T is not injective because T'v; = 0. Thus both S and 
T are not invertible. However, S +T equals J, which is invertible. Thus the 
set of noninvertible operators on V is not closed under addition, and hence 
it is not a subspace of L(V). 


COMMENT: If dim V = 1, then the set of noninvertible operators on V 
equals {0}, which is a subspace of L(V). 


Suppose n is a positive integer and a; € F for i,j = 1,...,n. Prove that 
the following are equivalent: 


(a) The trivial solution zı = --- = £a = 0 is the only solution to the 
homogeneous system of equations 


n 
X arate =0 
k=l 


n 
y an, kTk = Q. 
k=1 


(b) For every c1,...,C€n € F, there exists a solution to the system of 
equations 


n 
X a14Ze = C) 
k=1 


n 
X en bok = Cp. 
k=1 


Note that here we have the same number of equations as variables. 


SOLUTION: Define T € £(F") by 


n n 
T(21,...,2n) = ($ 01 ktk- --; X ankTe). 
k=1 k=1 


Then (a) above is the assertion that T is injective, and (b) above is the 
assertion that T is surjective. By 3.21, these two assertions are equivalent. 


CHAPTER 4 


Polynomials 


Suppose m and n are positive integers with m < n. Prove that there exists 
a polynomial p € P,(F) with exactly m distinct roots. 


SOLUTION: Define p € P,,(F) by 
p(z) = (z -—1)"-™*1(z — 2)(z — 3)... (z — m). 


Then p is a polynomial of degree n with exactly m distinct roots (which are 
1,...,m). 


Suppose that z,,...,2m41 are distinct elements of F and W1,---,Wm4ti E F. 
Prove that there exists a unique polynomial p € Pm(F) such that 


p(z) = wj 
for j =1,...,m +1. 
SOLUTION: Define T: P,,(F) — F™+! by 
Tp = (p(z1),... »P(2m+1)). 


We need to prove that T is injective (which implies that at most one polyno- 
mial p satisfies the condition required by the exercise) and surjective (which 
implies that at least one polynomial p satisfies the condition required by the 
exercise). l 

Clearly T is a linear map. If p € nullT, then 


p(z1) = -+- = P(zm+1) = 0, 


which means that p is a polynomial of degree m with at least m + 1 distinct 
roots, which means that p = 0 (by 4.3). Thus p is injective, as desired. 
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Now 
dim range T = dim?P,,(F) — dim null T 
=(m+1)-0 
= dim F+, 
where the first equality comes from 3.4 and the second equality holds because 


nullT = {0}. The last equality above implies that rangeT = F™+!. Thus 
T is surjective, as desired. 


COMMENT: Surjectivity of T can also be proved by using an explicit 
construction. But linear algebra, ee 3.4, gives us surjectivity easily 
once we get injectivity. 


3. Prove that if p,q € P(E), with p Æ 0, then there exist unique polynomials 
s,r € P(F) such that 


q=sp+r 


and degr < degp. In other words, add a uniqueness statement to the 
division algorithm (4.5). 


SOLUTION: Suppose p,q € P(F), with p Æ 0. We know from the division 
algorithm (4.5) that there exist s,r € P(E), with degr < deg p, such that 


qg=sp+r. 
To prove that s and r are unique, suppose that §,7 are in P(F), with 
deg r < deg p and 
q=sp4+r. 
Subtracting the last two equations are redrranging, we have 
(§-s)p=r-—-F. 


The right side of the equation above is a polynomial whose degree is less 
than deg p. If š were not equal to s, then the left side of the equation above 
would be a polynomial whose degree is at least degp. Thus we must have 
5 = s, which, from the equation above, implies that 7 = r. Thus the choices 
of s and r were indeed unique. 
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4. Suppose p € P(C) has degree m. Prove that p has m distinct roots if and 
only if p and its derivative p’ have no roots in common. 


SOLUTION: First suppose that p has m distinct roots. Because p has 
degree m, this implies that p can be written in the form 


p(z) = c(z—)...(z = Am); 


where A;,...,Am are distinct. To prove that p and p' have no roots in 
common, we must show that p’(A;) Æ 0 for each j. To do this, fix j. The 
expression above for p shows that we can write p in the form 


p(z) = (2 — As)a(z), 


where q is a polynomial such that q(A;) 4 0. Differentiating both sides of 
this equation, we have 


p'(z) = (z — dj)q'(z) + a(z). 
Thus 
P'(Az) = alà) 
#0, 


as desired. 

To prove the other direction, we will proved the contrapositive, meaning 
that we will prove that if p has less than m distinct roots, then p and p' 
have at least one root in common. To do this, suppose that p has less than 
m distinct roots. Then for some root À of p, we can write p in the form 


p(z) = (z — A)" q(z), 


where n > 2 and q is a polynomial. Differentiating both sides of this equa- 
tion, we have 


p'(z) = (z — d)"q'(z) + n(z — A)?" q(2). 
Thus p’(A) = 0, and so À is a common root of p and p’, as desired. 


5. Prove that every polynomial with odd degree and real coefficients has a real 
root. 


SOLUTION: Suppose that p is a polynomial with odd degree and real 
coefficients. By 4.14, p is a constant times the product of factors of the form 
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z — and/or z? + az + p, where A,a, 8 € R. Not all the factors can be of 
the form T? + az +B, because otherwise p would have even degree. Thus at 
least one factor must be of the form z — A. Any such 4 is a real root of p. 


COMMENT: Here is another proof, using calculus but not using 4.14. 
Suppose p is a polynomial with odd degree m. We can write p in the form 


p(x) = ao +ayr+-+++amz™, 


where @,...,@m E R and am 4 0. Replacing p with —p if necessary, we 
can assume that am > 0. Now 


ao a am-—1 
p(z) = a™(— + = +++ tam). 
This implies that 
lim p(z)=—-oo and lim p(z) = oo. 
T=- z—00 


The intermediate value theorem now implies that there is a real number À 
such that p(A) = 0. In other words, p has a real root. 


CHAPTER 5 


Eigenvalues and EFigenvectors 


Suppose T € L(V). Prove that if U,,...,Um are subspaces of V invariant 
under T, then U, + ---+U,, is invariant under T. 


SOLUTION: Suppose U),...,Um are subspaces of V invariant under T. 
Consider a vector u € Uj; +---+Um. There exist uy € Uj,..-,tm € Um 
such that 


U>= U1 +--+ +m. 
Applying T to both sides of this equation, we get 
Tu = Tu, +---+Tum. 


Because each U; is invariant under T, we have Tu; € Uj,..., Tum € Um. 
Thus the equation above shows that Tu € U1 +- --+Um, which implies that 
Ui +--+- + Um is invariant under T. 


Suppose T € L(V). Prove that the intersection of any collection of subspaces 
of V invariant under T is invariant under T. 


SOLUTION: Suppose {Ua}aer is a collection of subspaces of V invariant 
under T; here I is an arbitrary index set. We need to prove that Neer Ux, 
which equals the set of vectors that are in Ua for every a € I, is invariant 
under T. To do this, suppose u € (ler Ua. Then u € Ua for every a ET. 
- Thus Tu € Ug for every a € T (because every Ua is invariant under T). 
Thus Tu € flaer Ua, which implies that Neer Ve is invariant under T. 


Prove or give a counterexample: if U is a subspace of V that is invariant 
under every operator on V, then U = {0} or U = V. 
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SOLUTION: We will prove that if U is a subspace of V that is invariant 
under every operator on V, then U = {0} or U = V. Actually we will prove 
the (logically equivalent) contrapositive, meaning that we will prove that 
if U is a subspace of V such that U # {0} and U Æ V, then there exists 
T € L(V) such that U is not invariant under T. To do this, suppose U is a 
subspace of V such that U # {0} and U # V. Choose u € U \ {0} (this is 
possible because U # {0}) and w € V \U (this is possible because U # V). 
Extend the list (u), which is linearly independent because u Æ 0, to a basis 
(u,v1,...,Un) of V. Define T € L(V) by 


T(au + bivi +... brun) = aw. 


Thus Tu = w. Because u € U but w ¢ U, this shows that U is not invariant 
under T, as desired. 


4. Suppose that S,T € L(V) are such that ST = TS. Prove that null(T — AI) 
is invariant under S for every \ € F. 


SOLUTION: Fix A€ F. Suppose v € null(T — AI). Then 


(T — AI)(Sv) = T Sv — ASv 


= STv — ASv 
= S(Tv — àv) 
= 0. 


Thus Sv € null(T — AI). Hence null(T — AI) is invariant under S. 
5. Define T € £(F?) by 
T(w,z) = (z,w). 
Find all eigenvalues and eigenvectors of T. 


SOLUTION: Suppose À is an eigenvalue of T. For this particular operator, 
the eigenvalue-eigenvector equation T(w, z) = A(w, z) becomes the system 
of equations 


z= Aw 


w = Xz. 


Substituting the value for z from the first equation into the second equation 
gives w = A?w. Thus 1 = à? (we can ignore the possibility that w = 0 


39 CHAPTER 5. Eigenvalues and Eigenvectors 


because if w = 0, then the first equation above implies that z = 0). Thus 
à= 1 or à = —1. The set of eigenvectors corresponding to the eigenvalue 1 
is 

{(w,w):w €F}; 
The set of eigenvectors corresponding to the eigenvalue —1 is 

{(w, —w) : w € F}. 

6. Define T € L(F°) by 
T (21, z2, z3) = (2z2, 0, 523). 


Find all eigenvalues and eigenvectors of T. 


SOLUTION: Suppose 4 is an eigenvalue of T. For this particular operator, 
the eigenvalue-eigenvector equation T (z1, z2, 23) = A(z1, 22, z3) becomes the 
system of equations 


229 = Az, 
0 = Az 
523 = Àz3. 


If à # 0, then the second equation implies that z2 = 0, and the first 
equation then implies that zı} = 0. Because an eigenvalue must have a 
nonzero eigenvector, there must be a solution to the system above with 
z3 #0. The third equation then shows that \ = 5. In other words, 5 is the 
only nonzero eigenvalue of T. The set of eigenvectors corresponding to the 
eigenvalue 5 is 


{(0, 0, z3) 123 € F}. 


If à = 0, the first and third equations above show that zz = 0 and z3 = 0. 
With these values for z2, 23, the equations above are satisfied for all values 
of zı. Thus 0 is an eigenvalue of T. The set of eigenvectors corresponding 
to the eigenvalue 0 is 


{(z1, 0, 0) :z27 € F}. 
7. Suppose n is a positive integer and T € L(F") is defined by 
T(@1,-.-,;2n) = (z1 +--+ + Tn,- £1 ++ En); 


in other words, T is the operator whose matrix (with respect to the standard 
basis) consists of all 1’s. Find all eigenvalues and eigenvectors of T. 
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SOLUTION: Suppose À is an eigenvalue of T. For this particular oper- 
ator, the eigenvalue-eigenvector equation Tr = Ax becomes the system of 


equations 
Ti H-t + TEn = Ar, 
T1 H- + Tn = Afp. 
Thus 
AZ] = +- = ÀTp- 
Hence either À = 0 or £] = -+ = Tn. 


Consider first the possibility that 4 = 0. In this case all the equations in 
the eigenvector-eigenvalue system of equations above become the equation 
Tı +++ Tn = 0. Thus we see that 0 is an eigenvalue of T and that the 
corresponding set of eigenvectors equals 


{(z1,.--;2n) E F” : £1 +-+ + En = 0}. 


Now consider the possibility that zı = --- = zy; let t denote the com- 
mon value of 71,...,Zn. In this case all the equations in the eigenvector- 
eigenvalue system of equations above become the equation nt = At. Hence A 
must equal n (an eigenvalue must have a nonzero eigenvector, so we can take 
t #0). Thus we see that n is an eigenvalue of T and that the corresponding 
set of eigenvectors equals 


{(T1,... En) € F” : 2) =- = En}. 
Because the eigenvector-eigenvalue system of equations above implies 
that A = 0 or zı = -+> = Zp, we see that T has no eigenvalues other 
than 0 and n. 


8. Find all eigenvalues and eigenvectors of the backward shift operator T € 
L(F°) defined by 


T (21, 22, 23,--.) = (22, 23,.-.). 


SOLUTION: Suppose À is an eigenvalue of T. For this particular oper- 
_ ator, the eigenvalue-eigenvector equation Tz = Az becomes the system of 
equations 
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10. 


22> Az, 
z3 = Az 
24 = Az 


From this we see that we can choose z; arbitrarily and then solve for the 
other coordinates: 


22> Az 
23 = Az => 2; 


Z4 = Az3 = Mz 


Thus each A € F is an eigenvalue of T and the set of corresponding eigen- 
vectors is 


{(w, Aw, Xw, Bw...):w E F}. 


Suppose T € £(V) and dimrangeT = k. Prove that T has at most k +1 
distinct eigenvalues. 


SOLUTION: Let 21,...,Am be the distinct eigenvalues of T', and let 
U1,--+;Um be corresponding nonzero eigenvectors. If À; 4 0, then 


T(v;/à;) = v. 


Because at most one of A1,..-., Am equals 0, this implies that at least m—1 of 


the vectors v1,...,Um are in rangeT. These vectors are linearly independent 
(by 5.6), which implies that 


m — 1 < dimrangeT = k. 
Thus m < k + 1, as desired. 


Suppose T € L(V) is invertible and A € F\ {0}. Prove that A is an eigenvalue 
of T if and only if } is an eigenvalue of T~. 


SOLUTION: First suppose that À is an eigenvalue of T. Thus there exists 
a nonzero vector v € V such that 


Tv = dv. 
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11. 


12. 


Applying T—! to both sides of the equation above, we get v = AT~1v, which 
is equivalent to the equation T~!v = tv. Thus } is an eigenvalue of T-!. 

To prove the implication in the other direction, replace T by T—! and À 
by 1 and then apply the result from the paragraph above. 


Suppose S,T € L(V). Prove that ST and TS have the same eigenvalues. 


SOLUTION: Suppose that À € F is an eigenvalue of ST. We want to 
prove that A is an eigenvalue of TS. Because À is an eigenvalue of ST, there 
exists a nonzero vector v E€ V such that 


(ST)uv = dv. 
Now 


(TS)(Tv) = T(STv) 
= T (àv) 
= ATv. 


If Tv Æ 0, then the equation above shows that À is an eigenvalue of TS, as 
desired. : 

If Tv = 0, then A = 0 (because S(Tv) = Av) and furthermore T is 
not invertible, which implies that TS is not invertible (by Exercise 22 in 
Chapter 3), which implies that A (which equals 0) is an eigenvalue of TS. 

Regardless of whether or not Tv = 0, we have shown that À is an eigen- 
value of TS. Because was an arbitrary eigenvalue of ST, we have shown 
that every eigenvalue of ST is an eigenvalue of T'S. 

Reversing the roles of S and T, we conclude that every eigenvalue of TS 
is also an eigenvalue of ST. Thus ST and TS have the same eigenvalues. 


Suppose T € L(V) is such that every vector in V is an eigenvector of T. 
Prove that T is a scalar multiple of the identity operator. 


SOLUTION: For each v € V, there exists a, € F such that 
Tv = av. 


Because T0 = 0, we can choose ag to be any number in F, but for v € V\ {0} 
the value of a, is uniquely determined by the equation above. 

To show that T is a scalar multiple of the identity, we must show that ay 
is independent of v for v € V \ {0}. To do this, suppose v, w € V \ {0}. We 
want to show that a, = ay. First consider the case where (v, w) is linearly 
dependent. Then there exists b € F such that w = bu. We have 
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13. 


14. 


ayw = Tw 
= T(bv) 
= bTv 
= b(ayv) 
= aW, 
which shows that a, = ay, as desired. 
Finally, consider the case where (v, w) is linearly independent. We have 
Qysu(v + w) = T(v + w) 
=Tv+Tw 
= QV + QwW, 
which implies that 
(Qu4w — @y)U + (Qu4w — Qw)w = 0. 
Because (v, w) is linearly independent, this implies that ay4,, = ay and 
Qy+w = Ay, SO again we have a, = aw, as desired. 
Suppose T € L(V) is such that every subspace of V with dimension 
dimV — 1 


is invariant under T. Prove that T is a scalar multiple of the identity oper- 
ator. i 


SOLUTION: Suppose that T is not a scalar multiple of the identity op- 
erator. By the previous exercise, there exists u € V such that u is not an 
eigenvector of T. Thus (u, Tu) is linearly independent. Extend (u, Tu) to a 
basis (u, Tu, v1, ..., Un) of V. Let 


U = span(u, y,..., Un). 


Then U is a subspace of V and dimU = dimV — 1. However, U is not 
invariant under T because u € U but Tu ¢ U. This contradiction to our 
hypothesis about T shows that our assumption that T is not a scalar multiple 
of the identity must have been false. 


Suppose S,T € L(V) and S is invertible. Prove that if p € P(E) is a 
polynomial, then 


p(STS“!) = Sp(T) S~. 


SOLUTION: First suppose m is a positive integer. Then 
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16. 


(STS")™ = (STS—!)(STS“)...(STS™') 
= $T(S"'s)T(S“'!8)...(S~'s)TS™ 
= ST” S`, 
which is our desired equation in the special case when p(z) = z™. Multiply- 
ing both sides of the equation above by a scalar and then summing a finite 


number of equations of the resulting form shows that POTES = Sp(T)S-?} 
for every polynomial p € P(E). 


Suppose F = C, T € L(V), p € P(C), anda € C. Prove that a is an 
eigenvalue of p(T) if and only if a = p(A) for some eigenvalue À of T. 


SOLUTION: First suppose that a is an eigenvalue of p(T). Thus p(T)—al 
is not injective. Write the polynomial p(z) — a in factored form: 


p(z) — a = e(z — ài)... (z — àm), 


where c, À1,..-, Àm E C. We can assume that c # 0 (otherwise p is a 
constant polynomial, in which case the desired result clearly holds). The 
equation above implies that 


p(T) — al =c(T — `I)... (T — AmI). 


Because p(T) — al is not injective, this implies that T — A;J is not injective 
for some j. In other words, some À; is an eigenvalue of T. The formula 
above for p(z) — a shows that p(A;) — a = 0. Hence a = p(A;), as desired. 

For the other direction, now suppose that a = p(X) for some eigenvalue 
à of T. Thus there exists a nonzero vector v € V such that 


Tv = dv. 


Repeatedly applying T to both sides of this equation shows that Ttv = AFv 
for every positive integer k. Thus 


p(T )v = p(A)v 


= av. 
Thus a is an eigenvalue of p(T). 


Show that the result in the previous exercise does not hold if C is replaced 
with R. 
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18. 


19. 


SOLUTION: Define T € L(R?) by T(z, y) = (—y,z). Define p € P(R) by 
p(z) = z?. Then p(T) = T? = —I, and hence —1 is an eigenvalue of p(T). 
However, T has no eigenvalues (as we saw on page 78 of the textbook; the 
point here is that eigenvalues are required to be real because we are working 
on a real vector space), so there does not exist an eigenvalue of T such 
that —1 = p(A). 

Of course there are also many other examples. 


Suppose V is a complex vector space and T € L(V). Prove that T has an 


invariant subspace of dimension j for each j = 1,...,dimV. 


SOLUTION: There is a basis (v),..., ’gimy) with respect to which T has 
an upper-triangular matrix (see 5.13). For each j = 1,...,dimV, the span 
of (v,...,v;) is a j-dimensional subspace of V that is invariant under T 
(by 5.12). 


Give an example of an operator whose matrix with respect to some basis 
contains only 0’s on the diagonal, but the operator is invertible. 


SOLUTION: Let T€ L(F?) be the operator whose matrix (with respect 
to the standard basis) is 
0 1 
104° 


Obviously this matrix has only 0’s on the diagonal, but T is invertible (be- 
cause TT = I, as is clear from squaring the matrix above). 
Of course there are also many other examples. 


COMMENT: This exercise and the next one show that 5.16 fails without 
the hypothesis that an upper-triangular matrix is under consideration. 


Give an example of an operator whose matrix with respect to some basis 
contains only nonzero numbers on the diagonal, but the operator is not 
invertible. 


SOLUTION: Define T € C(F?) to be the operator whose matrix (with 
respect to the standard basis) is 
1 1 
1 1f? 


Then T(1,0) = T(0,1) = (1,1), so T is not injective, so T is not invert- 
ible, even though the diagonal of the matrix above contains only nonzero 
numbers. 
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21. 


Of course there are also many other examples. 


Suppose that T € L(V) has dimV distinct eigenvalues and that S € L(V) 
has the same eigenvectors as T (not necessarily with the same eigenvalues). 
Prove that ST = TS. 


SOLUTION: Let n = dimV. There is a basis (ui,...,Un) of V con- 
sisting of eigenvectors of T (see 5.20 and 5.21). Letting 4,...,An be the 
corresponding eigenvalues, we have 


Tv; = Aj; 
for each j. Each v; is also an eigenvector of S, so 
Sv; = QAjUj 


for some a; € F. 
For each j, we have 


(ST)v; = S(Tv;) = A;Sv;j = ajAj0; 
and 

(TS)v; = T(Svj) = ajTv; = a5 A;0;. 
Because the operators ST and TS agree on a basis, they are equal. 
Suppose P € L(V) and P? = P. Prove that V = null P @ range P. 


SOLUTION: First suppose u € nullPMrangeP. Then Pu = 0, and 
there exists w € V such that u = Pw. Applying P to both sides of the 
last equation, we have Pu = P?w = Pw. But Pu = 0, so this implies 
that Pw = 0. Because u = Pw, this implies that u = 0. Because u was an 
arbitrary vector in null PNrange P, this implies that null PNrange P = {0}. 

Now suppose v € V. Then obviously 


v = (v — Pv) + Pu. 


Note that P(v — Pv) = Pv — P?v =0, so (v — Pv) € null P. Clearly Pu € 
range P. Thus the equation above shows that v € null P+range P. Because 
v was an arbitrary vector in V, this implies that V = null P + range P. 

We have shown that null P N range P = {0} and V = null P + range P. 
Thus V = null P @ range P (by 1.9). 
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Suppose V = U @ W, where U and W are nonzero subspaces of V. Find all 
eigenvalues and eigenvectors of Py,w. 


SoLUTION: Because V = U @ W, each vector v € V can be written 
uniquely in the form 


v=u+w, 


where u € U and w € W. Recall that if v is represented as above, then 
Puywv =u. 

Suppose \ € F is an eigenvalue of Pyw. Then there exists a nonzero 
vector v € V such that Pywu = Av. Writing this equation using the 
representation of v given above, we have u = A(u + w). Thus 


(1—A)u— Aw = 0. 


Because V = U @ W, if 0 is written as the sum of a vector in U and a vector _ 
in W, then both vectors must be 0. Thus the equation above implies that 
(1 — A)u = Aw = 0. Because u and w are not both 0 (because v # 0), this 
implies that À = 1 or À = 0. 

For v € V with representation as above, the equation Py wv = 0 is 
equivalent to the equation u = 0, which is equivalent to the equation v = w, 
which is equivalent to the statement that v € W. This means that 0 is an 
eigenvalue of Pyw (because W is a nonzero subspace of V) and that W 
equals the set of eigenvectors corresponding to the eigenvalue 0. 

For v € V with representation as above, the equation Puwv = v is 
equivalent to the equation v = u, which is equivalent to the statement that 
v €U. This means that 1 is an eigenvalue of Py w (because U is a nonzero 
subspace of V) and that U equals the set of eigenvectors corresponding to 
the eigenvalue 1. 


Give an example of an operator T € £(R*) such that T has no (real) eigen- 
values. 


SOLUTION: Define T € £(R*) by 
T (21, T2, T3, T4) = (-z2, Ti, —T4, z3). 
Suppose A € R. For this particular operator, the eigenvalue-eigenvector 


equation T(x1,22,13,24) = A(£1, T2, £3, £4) becomes the system of equa- 
tions 
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24. 


=T? = Ar 
T1 = Ax? 
—T4 = AL3 
T3 = ÀT4. 


Multiplying together the first two equations and also multiplying together 
the last two equations gives —x, rq = A?T1T3 and —g£3T4 = A? 7324. If either 
Tı Or T2 does not equal 0, then the first two equations show that neither of 
21,22 equals 0. Similarly, if either z3 or z4 does not equal 0, then the last 
two equations show that neither of z3, 24 equals 0. Thus if À is an eigenvalue 
of T, then there is a solution to the system of equations above with zr 22 #0 
or 2324 Æ 0. Either way, we conclude that —1 = 4”, which is impossible for 
any real number à. Thus T has no real eigenvalues. 


Suppose V is a real vector space and T € L(V) has no eigenvalues. Prove 
that every subspace of V invariant under T has even dimension. 


SOLUTION: Suppose U is a subspace of V that is invariant under T. 
Thus T|y € £(U). If dimU were odd, then T|y would have an eigenvalue 
A € R (by 5.26), so there would exist a nonzero vector u € U such that 


T|yu = Au. 


Obviously this would imply that Tu = Au, which would imply that A is 
an eigenvalue of T. But T has no eigenvalues, so dim U must be even, as 
desired. 


CHAPTER 6 


Inner-Product Spaces 


Prove that if x,y are nonzero vectors in R?, then 


(z,y) = llz{lllyll cos, 


where @ is the angle between x and y (thinking of x and y as arrows with 
initial point at the origin). Hint: draw the triangle formed by =, y, and z—y; 
then use the law of cosines. 


SOLUTION: Suppose that z,y are nonzero vectors in R? and @ is the 
angle between z and y. Consider the triangle formed by =, y, and z — y: 


The law of cosines states that 
lz- yll? = lizll? + llyl|? — 2lizllilyll cos 8. 


As usual, we compute the norm of a vector squared by taking the inner 
product of the vector with itself: 


lz- yl? = (z - y, z - y) 
= (z, z) = (x, y) at (y, x) + (y, y) 
= |[x\\? + llyll? — 2(z, y). 
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Substitute the last expression for ||z — y||? into the left side of the law of 
cosines, obtaining 


lzi? + llyll? — 2(2,9) = Izl? + llull? — 2llzliliyll cos 4. 


Now subtract ||z||? + ||y||? from both sides of the equation above, and then 
divide both sides by —2, obtaining 


(z,y) = [[z\Illyl] cos 8. 
Suppose u,v € V. Prove that (u,v) = 0 if and only if 
llull < llu + avl| 
for allae F. 


SOLUTION: First suppose that (u,v) = 0. Let a € F. Then u,av are 
orthogonal. The Pythagorean theorem thus implies that 


llu + avl]? = Ijul? + llavi]? 
> ||ull?. 


Taking square roots gives ||u|| < ||u + av||, as desired. 
To prove the implication in the other direction, now suppose that |u|] < 
lu + av|| for all a € F. Squaring this inequality, we get 


lul? < llu + avli? 
= (u + av, u + av) 
= (u u) + (u, av) + (av, u) + (av, av) 
= |lull? +ā(u,v) + alu, v) + lal?llol]? 
= |lul? + 2Rea(u, v) + lal’llvl[? 


for alla € F. Thus 
—2Rea(u,v) < ļaļ?lloll? 


for all a € F. In particular, we can let a equal —t(u, v} for t > 0. Substitut- 
ing this value for a into the inequality above gives 


2t|(u, v)? < Plu, v)? 


for all £ > 0. Divide both sides of the inequality above by t, getting 
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2\(u,v)|? < el(u, v) P oll? 


for allt > 0. If v = 0, then (u,v) = 0, as desired. If v Æ 0, set t equal to 
1/||v||? in the inequality above, getting 


2|(u,v)|? < (u, v)’, 
which implies that (u, v} = 0, as desired. 


3. Prove that 


EENE 


j= j= 
for all real numbers a1, ..., an and b,..., bn. 
SOLUTION: Suppose a1,...,an,b],.--,bn € R. Using the usual inner 


product on R”, we have 
(Eat) = (EVA 
j=1 j=1 
= ((a1, V2a2, tee Vnan), (bi, b2/ V2, euti bn/Vn))” 
< [I(a1, V2a9,..., Vran )I lbr, b2/V2,..-,bn/ Vn)? 
= (Lia?) (2873) 
j=l j=1 


where the inequality above comes from the Cauchy-Schwarz inequality. 


4. Suppose u,v € V are such that 
lull =3, |lut+vl=4, |ju—vll =6. 
What number must ||| equal? 
SOLUTION: From the parallelogram equality, we have 
joj? = Wet vl? + le — ol? — 21u? 
_ 16 +36- 18 


2 
= 17. 
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Thus |{v|| = V17. 
5. Prove or disprove: there is an inner product on R? such that the associated 


norm is given by 
[|(x1, 2) |] = [xa] + |z2l 
for all (21,22) € R?. 


SOLUTION: We will show that there does not exist an inner product on 
R? such that the associated norm is given by the formula above by showing 
that the parallelogram equality is violated. Let 


u = (3,2) and v= (1,3). 

Then 
u+ v= (4,5) and u-v = (2, -—1). 

Using the formula above, we then have 

lu + vll? + lju — vi? = 81 +9 

= 90 

and 

2(lluli? + llull?) = 2(25 + 16) 

= 82. 
“Thus the parallelogram equality fails, as desired. 
6. Prove that if V is a real inner-product space, then 


_ lu+ol?- llu - vl? 


(u,v) : 


for all u,v € V. 


SOLUTION: Suppose V is a real inner-product space and u,v € V. Then 


lu + oll? - llu - vll? _ (u +v,u +v) —(u-v,u—) 


4 4 
la + 24u, 0) + [lvll? — (lul? — 2¢u, 0) + lol) 
4 
_ 4(u, 0) 
4 


= (u, v), 
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as desired. 
T: Prove that if V is a complex inner-product space, then 


lu + vll? — lju — vll? + llu + ivll’i — Ilu — to ||? 


(u, v) =, 4 


for all u,v € V. 


SOLUTION: Suppose V is a complex inner-product space and u,v € V. 


Then 
llu + v||? =(utv,utv) 
= |lull? + (u,v) + (v, u) + [lol]? 
and 
-llu — vl? = —-(u — v, u — v) 
= —|lull? + (u, v) + (v, u) — lloll? 
and 
illu + ivl]? = i(u + iv, u + iv) 
= illul|? + (u,v) — (v, u) + illvl|? 
and 


—i||u — iv||? = —i(u — iv, u — iv) 


= illul? + (u,v) — (v, u) — illli? 
Adding the four equations, we have 
lu + vl? — llu — vl? + ilu + ivl? — illu — ivl]? = 4(u, v), 
as desired. 


8. A norm on a vector space U is a function || ||: U — [0, 00) such that |u|] = 0 
if and only if u = 0, |lau|| = jajllul| for all a € F and all u € U, and 
lu + vll < llull + [lvl] for all u,v € U. Prove that a norm satisfying the 
parallelogram equality comes from an inner product (in other words, show 
that if || || is a norm on U satisfying the parallelogram equality, then there 
is an inner product ( , ) on U such that ||ul| = (u, u)}!/? for all u € U). 
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COMMENT: This is among the hardest exercises in the book. Instructors 
may want to simplify this exercise slightly by allowing students to consider 
only the case where F = R. 


SOLUTION: Suppose that U is a vector space and || || is a norm on U 
satisfying the parallelogram equality. We want to find an inner product ( , ) 
on U such that |u|] = (u, u)!/? for all u € U. 

First consider the case where F = R. For ‘u,v € U, define (u,v) by 


lu + ull? — |lu — vll? 


(u, v) = 4 


This definition is motivated by Exercise 6 of this chapter, which gives a 
formula for the inner product in terms of norms. 
For u € U we have 


_ lutul - Ilu - ul? 


(uu) - 
_ l2ulÊ — 01? 
4 
= |lull?. 
Thus ||ul| = (u,u)!/?, as desired. However, we still must show that ( , ) 


satisfies the properties required of an inner product. 

Because (u, u) = ||u]|? (as shown above), we have (u, u) > 0 forallu € U, 
with equality if and only if u = 0; these properties follow from the properties 
of a norm. Thus ( , ) satisfies the positivity and definiteness properties 


required of an inner product. 
To prove that ( , } is additive in the first slot, let u, v, w € U. Then 


4((u +v, w) — (u, w) — (v, w)) 
= |ju +v + wl? — llu + v - wll? — |lu + wll? 
+ |lu — wl? — llv + wll? + [lv — wl? 
= jju +v + wll? + (lu — wll? + llv — wll?) 
— llu + v - w|? - (llu + wll? + llv + wll), 


where the first equality comes from the definition of ( , ). In the last 
equality above, the parentheses indicate groupings to which we will apply 
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the parallelogram equality, which asserts that the sum of the norms squared 
of two vectors x,y € U can be computed from the formula 


lz + yll? lz- yl? 


2 2s 
ll? + ll? = = > 


Applying the parallelogram equality to the two terms in parentheses above 
(take r = u — w,y = v — w for the first sum in parentheses, then z = 
u +w,y = v + w for the second term in parentheses) gives 


A((u + v, w) — (u, w) — (v, w)) 


v —2w||? — v||2 
Sjan ea leew 
oe lu+v+2wl? _ llu- vll? 
lu +v- wll 5 a 


llu +v — 2w||? 

2 
lu +v + 2w]||? 
= ge 


= (llu + v + wll? + lwli?) + 
— (llu +v — wll? + lwll?) - 


Applying the parallelogram equality to the two terms in parentheses above 
(take t = u +v + w,y = w for the first sum in parentheses, then z = 
u+vu—w,y = w for the second term in parentheses) gives 


4((u +v, w) — (u, w) — (v, w)) 


+v + 2w|? +l? llu +v- ?2wlļ? 
_ llu +v +2wl| le v|| g v — 2w|| 


2 2 2 
luto? _ lu+v-2wl? llu +v+2wl? 
2 2 2 


Thus 
(u +v, w) = (u, w) + (v, w), 


completing the proof that ( , ) is additive in the first slot. 
To prove that ( , ) is homogeneous in the first slot, let u,v € U. If n is 
a positive integer, then 
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(nu, v) = (u+---+4,v) 


n times 


= (uv) +-+ (uo) 


n times 
= n(u, v), 


where the second equality comes from additivity in the first slot, which we 
have already verified. Replacing u with u/n in the equality above gives 
(u,v) = n(u/n,v), which implies that 


(Fv) = 2 (uv). 


Let m be another positive integer, and replace u with mu in the equality 
above, getting 


(u,v) = Š (muo) 


> Tuv), 


where the second equality holds because we have already shown that (, } 
is homogeneous in the first slot with respect to positive integers. We have 
now shown that (, ) is homogencous in the first slot with respect to positive 
rational numbers. 

From the definition of ( , ), we have 


_ [leu + ol)? = i-u - ol? 
(uu) = H 


_ _lu+ol?- lu- ol? 
“4 
= —{u,v). 


Combining this with the result from the previous paragraph, we can now 
conclude that {, ) is homogeneous in the first slot with respect to all rational 
numbers. 

Now suppose that A € R. There exists a sequence rj,r2,... of rational 
numbers such that lim,_.65 Tn = A. Thus 
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A(u,v) = lim rp(u, v) 
n= 
= lim (rau, v) 
nw 
mn autol? — rau — vl 
= lin A. 
n=+00 4 
In the next paragraph we will show that limpio ||Tnu + ul] = ||Au + vl] and 
limno |[rnu — v|| = ||Au— v||. Combining this with the last equation above 


we can conclude that 


làu + vl? — [làu — vll? 


Alu, v) = 7 


= (Au, v}, 


which will complete the proof that ( , } is homogeneous in the first slot. 
If z,y € U, then 


lzi = lly + (£ — y)ll 
< |lyll + lz — yll 


and thus 

llzll — Ilyll < lz — yll- 
Interchanging the roles of z and y, we get 

lull - Well < Illz — yll- 


Because |||æ|| — llyll| equals |[x{| — [lyl| or llyll — llæll, we can now conclude 
that 


Hizli- lylll < læ — yll- 
With z = rau +v and y = Au + v, this inequality gives 


|lirau + vll — [Au + vll] < raw — Aull 
= [rn — Alllull. 


Because limp_.o9 Tn = A, this shows that 
lim |[rau + vj] = ||Au + vll. 
n—-+0o 


Replacing v with —v, we have 
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lim ||rpu — v|| = ||Au — vll. 
n= 


The last two equations are the promised ingredients that were needed for 
the proof that ( , ) is homogeneous in the first slot. 
Finally, we must show that (u,v) = (v, u) (recall that we are considering 
the case where F = R). This last step is easy: 
lu + vll? — luo]? 


(u, v) = 4 
_ lv +- lv - ul? 
4 
= (v,u). 
This completes the proof that ( , } is an inner product when F = R. Whew! 
Now consider the case where F = C. For u,v € U, define (u, v) by 
fa, o) = Wet vl? = liu — vl? + llu + ioli — [lu — olli 
4 
This definition is motivated by Exercise 7 of this chapter, which gives a 
formula for the inner product in terms of norms. 
For u € U we have 
= lu + ul]? — [lu — ull? + llu + iul ’i — llu — eel] 
ie 4 
_ Wu]? + [1+ af? lleal|?é — [1 — illul? 
E 4 
_ 4llull? + 2llull?i — 2llull?i 
R 4 


(u,u 


= |lull?. 


Thus |lu|| = (u,u)!/?, as desired. However, we still must show that (, ) 
satisfies the properties required of an inner product. 

Because (u, u) = ||u||? (as shown above), we have (u, u) > 0 for all u € U, 
with equality if and only if u = 0; these properties follow from the properties 
of a norm. Thus ( , ) satisfies the positivity and definiteness properties 
required of an inner product. 

For convenience, let’s define ( , )R by 


_ let ol? = llu vl? 


(u, v)R 4 
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Here the subscript R reminds us that ( , }r, was the inner product we defined 
when considering the case F = R. Now we are assuming that F = C, but 
(, )r is still well defined. Note that 


(u,v) = (u, v)r + (u, iv) Ri. 


We have already proved that ( , }r is additive in the first slot, and now 
we use that information. Let u,v,w € U. Then 


(u+v,w) = (u +v, wr + (ut+v,iw)Ri 
= (u, w)r + (v, w)r + (u, iw)Ri + (v, iw) Ri 
= ((u,w)r + (u, iw)ri) + ((v, w)R + (v, iw) Ri) 
= (u, w) + (v, w). 


Thus ( , ) is additive in the first slot. 
To prove that ( , ) is homogeneous in the first slot, let u,v E U. AER, 
then 


(Au, v) = (Au, vìr + (Au, iv) Ri 
= Xu, vr + A(u, iv) Ri 
= Au, v), 


where we have used the homogeneity of { , r in the first slot: The last 

equation above shows that ( , ) is homogeneous in the first slot with respect 

to all real numbers. We must still extend this result to complex numbers. 
Note that 


lliu + vl]? — [liu — vll? + lliu + ivl|?i — lliu — ivll?i 
4 
zs lilu + v)|[?é — lilu — v)|l?é — lilu + iv) ||? + lilu — iv) ||? 
4 
lu + vll?é — llu — vlii — llu + oll? + llu — ivl]? 
4 


(iu, v) = 


= i(u,v). 


Combining this result with additivity and homogeneity with respect to real 
numbers, we get that 


((a + bi)u, v) = (a + bi){u, v) 
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for all a,b € R. In other words, ( , } is homogeneous in the first slot with 
respect to all complex numbers. 
Finally, we must show that (u,v) = (v,u). This last step is easy: 
nue llu + vll? — llu — vll? + |lu + ivll?i — [lu — żvll?i 
4 
_ le +a? = llv — ull? + lli(—~iu +o) — [|(—2) (u + v)? 
= ee ee ee ee 


_ llv + ull? — lu — ull? + llv + cull? — [lv — tulle 
— = ge, aa 


= (v,u). 
This completes the proof that ( , ) is an inner product when F = C. 
9. Suppose n is a positive integer. Prove that 
Ca oe a 


is an orthonormal list of vectors in C[—7, r], the vector space of continuous 
real-valued functions on [—z, 7] with inner product 


T 
(fa) = f Fla)a(e) ae. 
COMMENT: This orthonormal list is often used for modeling periodic 
phenomena such as tides. 


SOLUTION: First we need to show that each element of the list above 
has norm 1. This follows easily from the following formulas: 


_2 jt — sin 2ft 
J (sin jt)? dt ae a 


' 27 + sin 2jt 
t)? dt = H, 
J (cosst) i 
Next we need to show that any two distinct elements of the list above 
are orthogonal. This follows easily from the following formulas, valid when 


j#k: 
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J (sin jt)(sin kt) dt = 
jsin(j — k)t + ksin(j — kt — j sin(j + k)t + ksin(j + k)t 
2(j — k)(j + k) 
/ (sin jt)(cos kt) dt = 


jcos(j — k)t + kcos(j — k)t + j cos(j + k)t — kcos(j + k)t 
2(k — j) + k) 


J (cos jt)(cos kt) dt = 
jsin(j — k)t + ksin(j — k)t + jsin(j + k)t — ksin(j + k)t 
2(7 — k)(j + k) 
«4\2 
J (in 5e)(c05 4) dt = oe 


On P2(R), consider the inner product given by 


l 
(p,q) = [ p(x)q(2) de. 


Apply the Gram-Schmidt procedure to the basis (1,z,27) to produce an 
orthonormal basis of P2(R). 


SoLuTION: Applying the Gram-Schmidt procedure to (1, x, £?) produces 
(using elementary calculus and some arithmetic) the following orthonormal 
basis of P2(R): 


(1, V3(—1 + 22), V5(1 — 62 + 62”). 


What happens if the Gram-Schmidt procedure is applied to a list of vectors 
that is not linearly independent? 


SOLUTION: Suppose (v},...,Um) is a linearly dependent list of vectors 
in V. 

If vı = 0, then at the first step of applying the Gram-Schmidt procedure 
to (v1,...,Um) we will be dividing by 0 when trying to set e; = v;/||v;||. 

If vı # 0, then by the linear dependence lemma (2.4), some 2; is in 
span(v),...,0j-1); here we choose j to be the smallest positive integer 
with this property. If we apply the Gram-Schmidt procedure to produce 
(e1,...,ej;-1) at the end of step j, then 


span(v;,...,vj-1) = span(e;,...,e;-1). 
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12. 


Thus v; € span(e},...,e;-1). By 6.17, this implies that 
vj = (vj, eijer +++ + (vj, ej-1)ej-1- 


Thus the Gram-Schmidt formula 6.23 for e; includes a division by 0, which 
is not allowed. 


Suppose V is a real inner-product space and (v,..., vm) is a linearly inde- 
pendent list of vectors in V. Prove that there exist exactly 2” orthonormal 
lists (€1,...,€m) of vectors in V such that 


span(v),...,v;) = span(e1,...,e;) 
for all j € {1,...,m}. 
SOLUTION: For j = 1, the condition above states that 
span(v;) = span(e;). 


Because there are only two vectors in span(v,) with norm 1 (these two 
vectors are v;/||v1{| and —v;/||v;||), we have only these two choices for ey. 

Now suppose that j > 1 and that an orthonormal list (e1,...,e;-1) has 
been chosen such that 


span(vj,...,vj-1) = span(e1,...,€;-1). 


The Gram-Schmidt procedure produces e; € V such that (ej,..., ej) is an 
orthonormal list and 


span(v),...,vj) = span(ei,...,e;). 


Suppose e;’ € V is another vector with these properties, meaning that 
(e1, .--,€j-1,€3') is an orthonormal list and 


span(v,...,0;) = span(e1,...,ej-1, e;’). 


The last two equations show that span(e},...,¢j;-1,e;’) = span(e1,...,e;). 
In particular, e;’ € span(e1,...,e;), which implies that 
e;' = (e;’, eije: t-t (e;', e;)e; 
= (ej',ej)ej, 


where the first equality comes from 6.17 (with span(e1,...,e;) replacing V 
and j replacing n) and the second equality holds because (e1,... €j-1,€;') 
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is an orthonormal list. Taking norms of both sides of the last equation, and 
recalling that e; and e;’ both have norm 1, we see that |(e;’,e;)| = 1. Thus 
(e;’,e;) = 1 or (e;’,e;) = —1. Hence the last equation above implies that 
ej' =e; or ej' = —e;. 

We have shown that there are exactly two possible choices for each €j. 
As j ranges from 1 to m, this gives us exactly 2™ possible choices for 


(e1,.-.,€m). 
13. Suppose (€1,...,€m) is an orthonormal list of vectors in V. Let v € V. 
Prove that 
lol? = [{v, e1)[? +--+ + [(v, em)? 
if and only if v € span(e),.-..,ém). 


SOLUTION: Extend (e1,...,@m) to an orthonormal basis (e1,..., €n) 
of V. Then 


v = (v,e1)e1 +--- + (V, Enjen 
and 
loll? = (v, e1)? +--+ 1(, en) [5 
see 6.17. From the last equation, we see that 
loll? = (v, e1)? +--+ + (v, em)? 


if and only if (v, em4+1) = --- = (Vv, €n) = 0. From the first equation above, 
this happens if and only if 


v= {v, €1)€1 +---+ (v, Em)em, 
which happens if and only if v € span(e1,. sy êm): 


14. Find an orthonormal basis of P2(R.) (with inner product as in Exercise 10) 
such that the differentiation operator (the operator that takes p to p’) 
on P2(R) has an upper-triangular matrix with respect to this basis. 


SOLUTION: Because 1’ = 0, x’ = 1, and (x?) = 2z, the differentiation 
operator on P2(R) has an upper-triangular matrix with respect to the basis 
(1,2,27). However, (1,z,2”) is not an orthonormal basis. But, as can be 
seen from the proof of 6.27, if the Gram-Schmidt procedure is applied to this 
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15. 


16. 


17. 


basis, we will get an orthonormal basis with respect to which the differenti- 
ation operator has an upper-triangular matrix. As we saw in Exercise 10 of 
this chapter, the Gram-Schmidt procedure applied to (1,2, z?) gives 


(1, V3(—1 + 2z), V5(1 — 6x + 6z°)). 
which is our desired orthonormal basis of P2(R). 
Suppose U is a subspace of V. Prove that | 

dim U+ = dim V — dim U. 
SOLUTION: From 6.29, we know that 
V=U UŁ. 
Thus by Exercise 17 in Chapter 2, we have 
dim V = dim U + dim U+, 
which implies that dim UŁ = dim V — dim U. 
Suppose U is a subspace of V. Prove that U+ = {0} if and only if U =V. 
SOLUTION: From 6.29, we know that 
V=U U+. 
This clearly implies that U4 = {0} if and only if U = V. 


Prove that if P € L(V) is such that P? = P and every vector in null P is 
orthogonal to every vector in range P, then P is an orthogonal projection. 


SOLUTION: Suppose P € L(V) is such that P? = P and every vector 
in null P is orthogonal to every vector in range P. Let U = range P. We 
will show that P equals the orthogonal projection Py. To do this, suppose 
v EV. Then 


v = Pv + (v — Pv). 


Clearly Pu € range P = U. Also, P(v — Pv) = Pv — P*v = 0, which means 
that v — Pv € null P. Thus v — Pv is orthogonal to every vector in U. In 
other words, v — Pu € U+. Thus the equation above writes v as the sum of 
a vector in U and a vector in U+. In this decomposition, the vector in U 
equals, by definition, Pyv. Hence Pu = Pyv, as desired. 
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Prove that if P € L(V) is such that P? = P and 
Pull < llel 
for every v € V, then P is an orthogonal projection. 


SOLUTION: Suppose u € range P and w € null P. If we can show that 
(u, w) = 0, then by the previous exercise we can conclude that P is an 
orthogonal projection. 

Because u € range P, there exists u’ € V such that 


u = Pu’. 
Applying P to both sides of this equation, we have 
Pu = P*y! 
= Pu! 


=u. 
Because w € null P, this implies that 
P(u+aw)=u 
for every a € F. Thus 
lul? = || P(u + aw)||? 
< |lu + awli? 


for every a € F, where the second line follows from our hypothesis that 
|| Pol] < |lv|] for every v € V. The inequality above implies (see Exercise 2 
of this chapter) that (u, w) = 0, as desired. 


Suppose T € L(V) and U is a subspace’of V. Prove that U is invariant 
under T if and only if PyT Py = T Py. 


SOLUTION: First suppose that U is invariant under T. Let v € V. 
Then Pyv € U and hence T(Pyv) € U (because U is invariant under T). 
Thus Py(T(Pyv)) = T(Puv). Because v was an arbitrary vector in V, this 
implies that PyT Py = T Py, as desired. 

To prove the implication in the other direction, now suppose that 


PuT Py =TPy. 
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Suppose u € U. Then Pyu = u, so applying both sides of the equation 
above to u gives Py(Tu) = Tu, which implies that Tu € U. Because u was 
an arbitrary vector in U, this implies that T is invariant under U, as desired. 


Suppose T € L(V) and U is a subspace of V. Prove that U and UŁ are 
both invariant under T if and only if PyT = TPy. 


SOLUTION: First suppose that U and UŁ are both invariant under T. 
By the previous exercise, this implies that 


PyT Py =TPy 
and 
PystTPys = T Py. 
But Pys = I — Py, so the last equation becomes 
(I - Py)TU - Pu) = TU — Py). 
Expanding both sides of the equation above and rearranging terms, we get 
PyuT Py = PyT. 


Combining this with the first equation above, we get PyT = T Py, as desired. 
To prove the implication in the other direction, suppose now that 


Pol =T Py: 
Then 
PyT Py = (PyT)Py 
= (TPy)Py 
= TPg? 
=TPy, 


which implies (by the previous exercise) that U is invariant under T, as 
desired. Also, 
PysTPys = ((I — Pu)T) Pya 

= (T — PyT)Pyt 

=(T-TPy) Py. 

=T(1 — Py) Py. 

=TPy.s? 

=TPyu, 
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which implies (by the previous exercise) that U+ is invariant under T, as 
desired. 


In R4, let 
U = span((1, 1,0,0), (1, 1, 1,2)). 
Find u € U such that ||u — (1, 2,3, 4)|| is as small as possible. 


SOLUTION: First we find an orthonormal basis of U by applying the 
Gram-Schmidt procedure to ((, 1,0,0), (1, 1,1, 2)), getting 


€e = 


(7) 


1 2 
e2 = (0,0, —, =). 
5 ( ws z) 
Thus with e],€2 as above, (e1,€2) is an orthonormal basis of U. By 6.36 
and 6.35, the closest point u € U to (1,2,3, 4) is 


((1, 2, 3, 4), e1)e1 + ((1, 2, 3, 4), €2)€2, 
which equals 
(3,1,2) 
27325: 5 
Find p € P3(R.) such that p(0) = 0, p'(0) = 0, and 
1 
J 2 + 32 — pla) |? dz 
0 
is as small as possible. 


SOLUTION: Define an inner product on P3(R) by 


1 
(fo) = | Hegla) ae. 
Let q(x) = 2 + 3z, and let 


U = {p € P3(R) : p(0) = 0, p'(0) = 0}. 


With this notation, our problem is to find the closest point pEU tog. To 
do this, first we find an orthonormal basis of U. 
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A polynomial p satisfying p(0) = 0, p'(0) = 0 has constant term 0 and 
first degree term also equal to 0. Thus a basis of U is 
(x?, z’). 
Apply the Gram-Schmidt procedure to this basis, getting 
ey = V522 
€2 = V7(-52? + 62°). 
Thus with e,,e2 as above, (e1,€2) is an orthonormal basis of U. By 6.36 
and 6.35, the closest point p € U to q is given by the formula 
p = (q,e1)e1 + (q, €2)e2. 
A short computation now shows that 
p(x) = 242? — migs, 


Find p € P5(R) that makes 
kig 
/ Ising — p(x)? dz 
-r 


as small as possible. (The polynomial 6.40 is an excellent approximation 
to the answer to this exercise, but here you are asked to find the exact 
solution, which involves powers of x. A computer that can perform symbolic 
integration will be useful.) 


SOLUTION: Let C[—7, m] denote the real vector space of continuous real- 
valued functions on [—7, 2] with inner product 


Wo) = [7 sle)ole) ae. 


Let v € C[—1, z] be the function defined by v(x) = sinz. Let U denote the 
subspace of C[—7, m] consisting of the polynomials with real coefficients and 
degree at most 5. We need to find p E U such that ||v — p|| is as small as 
possible. 

First find an orthonormal basis of U by applying the Gram-Schmidt 
procedure (using the inner product above) to the basis (1, £, £?, £3, x4, x5) 
of U, producing the orthonormal basis (e1, €2, €3, €4, €5, €e), Where 
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1 
a= Von’ 
e2 = mo’ 
VEG — 3x?) 
BRaT Sa 
(3802 — 523) 
= 
eee 3(3x4 — 30022? + 3524) 
i 8209/2 ’ 
y H (15442 — 70r?r? + 6325) 
Sen a 


Now compute Pyv using 6.35 (with m = 6), getting 


105(1485 — 1537? + 7t)  315(1155 — 1257? +24) 3 
Pou ei a a e e a ia 


$ 693(945 — 1057r? + wi) s. 
8r10 


Finally, 6.36 and the discussion following 6.42 show that the function above 
is the one we seek. 


Find a polynomial q € P2(R) such that 


1 
25) = | p(a)a() dx 
for every p € P2(R). s 


SOLUTION: We will need an orthonormal basis of P2(R.), where the inner 
product of two polynomials in P2(R) is defined to be the integral from 0 to 
1 of the product of the two polynomials. An orthonormal! basis of P2(R) 
was already computed in Exercise 10 of this chapter. Specifically, let 


e(z) =1 
eo(x) = V3(—1 + 22) 
e3(x) = V5(1 — 6x + 62”). 
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26. 


Then (e1, €2,e3) is an orthonormal basis of P2(R). 
Define a linear functional y on P2(R) by 


(p) = (5). 


We seek q € Po(R) such that y(p) = (p,q) for every p € Po(R). By the 
formula given in the proof of 6.45, we have 


q = pleri)er + p(e2)e2 + p(e3)es. 


Evaluate the right side of the equation above to get 
3 2 
q(x) = Sat 15z — 152°. 


Find a polynomial g € P2(R) such that 


1 1 
f p(z)(cosrz) dx = J p(x)q(x) dx 
0 0 


for every p € Po(R). 
SOLUTION: Define a linear functional y on P2(R) by 


1 
o(p) = f p(z)(cos m2) de. 


We seek q € P2(R) such that y(p) = (p,q) for every p € Po(R), where 
the inner product on P2(R) is defined as in the previous exercise. Letting 
€1,€2,€3 be as in the previous exercise, but using our new definition of ¢, 
we again have 


q = pler)er + p(ez)ez + v(es)es. 
Evaluate the right side of the equation above to get 


_ 12-242 
ae 


q(x) 


Fix a vector v € V and define T € £(V,F) by Tu = (u,v). For a € FE, find 
a formula for T*a. 


SOLUTION: Because T € L(V,F), we know that T* € CL(F,V). Fix 
a € F. Then T*a is the unique vector in V such that 
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(a) (Tu, a) = (u, T*a) 


for all u € U. The inner product on the right is the inner product in V, but 
the inner product on the left is the usual inner product on F: the product 
of the entry in the first slot with the complex conjugate of the entry in the 
second slot. Thus 


(Tu, a) = (Tu)a- 
= (u, v)a 
(b) = (u, av). 


Comparing (a) with (b) gives 
(u, T*a) = (u, av) 
for all u € U. Thus T*a = av. 
27. Suppose n is a positive integer. Define T € L(F") by 
T (23; 64S) = (0, 21,- <., Zn-1)- 
Find a formula for T*(z1,... , Zn). 


SOLUTION: Fix (z1,..., 2n) € F”. Then for every (w1, ..., Wn) € E”, we 
have 


((wi, -<3 Wn), T* (213. -+3 Zn)) = (T(wi,..-, Wn), (213-3 2n)) 
= ((0, wi, - os ,Wn—1), (21,--- Zn) 
= W122 +--+- + Wn—iZn 


((wi,-.-,; Wn), (225. -- , Zn, 0)}- 


Thus 
T* (21, -<3 Zn) = (22, - -- , Zn, 0)- 


28. Suppose T € L(V) and à € F. Prove that A is an eigenvalue of T if and 
only if À is an eigenvalue of T*. 


SOLUTION: We have 
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à is an not eigenvalue of T <=> T — XI is invertible 

<=> S(T -AD =(T-ANS=!I 
for some S € L(V) 

&> (T- ANS = S(T -AD =I 
for some S € L(V) 

<=> (T — XI)* is invertible 

<=> T* — XI is invertible 

<> i is not an eigenvalue of T*. 


Thus A is an eigenvalue of T if and only if À is an eigenvalue of T*. 


29. Suppose T € L(V) and U is a subspace of V. Prove that U is invariant 
under T if and only if UŁ is invariant under T*. 


SOLUTION: First suppose that U is invariant under T. To prove that 
U+ is invariant under T*, let v € UŁ. We need to show that T*v € U+. 
But 


for every u € U (because if u € U, then Tu € U and hence Tu is orthogonal 
to v, an element of Ut). Thus T*v € Ut, and hence U+ is invariant under 
T*, as desired. i 

To prove the other direction, now suppose that U+ is invariant under T*. 
Then by the first direction, we know that (U+)+ is invariant under (T*)*. 
But (U+)+ = U (by 6.33) and (T*)* = T, so U is invariant under T, 
completing the proof. l : 


30. Suppose T € L(V,W). Prove that 
(a) T is injective if and only if T* is surjective; 
(b) T is surjective if and only if T* is injective. 
SOLUTION: First we prove (a): 
T is injective <= nullT = {0} 
<=> (rangeT*)+ = {0} 


<=> range T* = W 
<= T" is surjective , 
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31. 


32. 


where the second line comes from 6.46(c). 
Now that (a) has been proved, (b) follows immediately by replacing T 
with T* in (a). 


Prove that 
dim null T* = dim null T + dim W — dim V 
and | 
dim range T* = dim range T 
for every T € L(V, W). 
SOLUTION: Let T € L(V,W). Then 
dim null T* = dim(range T)+ 


= dim W — dim range T 
= dim null T + dim W — dim V, 


where the first equality comes from 6.46(a), the second equality comes from 
Exercise 15 of this chapter, and the third equality comes from 3.4. This 
proves the first equality that we seek. 

To prove the second equality, note that 


dim range T* = dim W — dim null T* 
= dim V — dim nul T 
= dim range T, 


where the first and third equalities come from 3.4 and the second equality 
comes from the first part of this exercise. This proves the second equality 
that we seek. ; 


Suppose A is an m-by-n matrix of real numbers. Prove that the dimension 
of the span of the columns of A (in R™) equals the dimension of the span 
of the rows of A (in R”). 


SOLUTION: Let T € £(R",R™) be such that the matrix of T (with 
respect to the standard bases) equals A. Then rangeT equals the span of 
the columns of A. Thus 
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dimension of the span of the columns of A 
= dimrangeT 
= dim range T* 
= dimension of the span of the columns of M(T*) 
= dimension of the span of the columns of the transpose of A 
= dimension of the span of the rows of A, 


where the second equality comes from the previous exercise and the fourth 
equality comes from 6.47. 


CHAPTER 7 


Operators on | 
Inner-Product Spaces 


Make P2(R) into an inner-product space by defining 


1 
(p,a) = f p(z)a(2) dz. 


Define T € L(P2(R)) by T(ao + az + azz?) = az. 
(a) Show that T is not self-adjoint. 


(b) The matrix of T with respect to the basis (1, x, z?) is 


000 
01 0]. 
000 


This matrix equals its conjugate transpose, even though T is not self- 
adjoint. Explain why this is not a contradiction. 


SOLUTION: (a): Note that 


(T1, x) = (0, z) 
=0 
but 
(1, Tz) = (1,1) 
A 1 
2 


15 
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Thus (T1, x) Æ (1, Tx), which shows that T is not self-adjoint. 

(b): The result stating that the matrix of T* is the conjugate transpose 
of the matrix of T has as a hypothesis that we are working with orthonormal 
bases (see 6.47). Because (1, x, x”) is not an orthonormal basis of P2(R), we 
cannot compute the matrix of T* with respect to this basis by taking the 
conjugate transpose of the matrix of T. 


2. Prove or give a counterexample: the product of any two self-adjoint opera- 
tors on a finite-dimensional inner-product space is self-adjoint. 


SOLUTION: Let S,T € L(F?) be the operators whose matrices (with 
respect to the standard basis) are given by 


1 0 


M(s) =| 4 2 


| and m=? als 


Each of these matrices obviously equals its conjugate transpose, and hence 
S,T are self-adjoint. Now 


MIST) = Mis\m(r)=|9 4 | 


Because M(ST) does not equal its conjugate transpose, ST is not self- 
adjoint. Thus we have an example of two self-adjoint operators whose prod- 
uct is not self-adjoint. 

Of course there are also many other examples. 


COMMENT: Suppose S,T € L(V) are self-adjoint. Then ST is self- 
adjoint if and only if ST = T'S (as is easy to see). 
3. (a) Show that if V is a real inner-product space, then the set of self-adjoint 


operators on V is a subspace of L(V). 


(b) Show that if V is a complex inner-product space, then the set of self- 
adjoint operators on V is not a subspace of L(V). 


SOLUTION: (a): Suppose V is a real inner-product space. Obviously the 
zero operator is self-adjoint. Furthermore, if S,T € L(V) are self-adjoint, 
then 


(S+T) =S*4+T" 
=S+T, 
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and thus S+T is self-adjoint. Finally, if T € £(V) is self-adjoint anda € R, 
then 


(aT)* = aT" 
= aT, 


and thus aT is self-adjoint. We have shown that the set of self-adjoint 
operators on V contains the zero operator and that it is closed under addition 
and scalar multiplication. Thus the set of self-adjoint operators on V is a 
subspace of L(V). 

(b): Suppose now that V is a complex vector space. The identity operator 
I is self-adjoint, but (iJ)* = —iI so il is not self-adjoint. Thus the set of 
self-adjoint operators on V is not closed under scalar multiplication and 
hence it is not a subspace of L(V). 


4. Suppose P € L(V) is such that P? = P. Prove that P is an orthogonal 
projection if and only if P is self-adjoint. 


SOLUTION: First suppose that P is an orthogonal projection. Thus there 
is a subspace U of V such that P = Py. Suppose v1, v2 € V. Write 
vi =u, +w, v2 = u + w, 


where u1, ug € U and w1, we € UL (see 6.29). Now 


(Pv, v2) = (u1, u2 + we) 
= (ui, u2) + (ui, we) 
= (u1, u2) 
= (u, U2) + (w1, Ue) 
= (uy + wi, U2) 
= (v, Pug). 


Thus P = P*, and hence P is self-adjoint. 
To prove the implication in the other direction, now suppose that P is 
self-adjoint. Let v € V. Because P(v — Pv) = Pv — P?v = 0, we have 


v — Pv € null P = (range P*)+ = (range P)+, 
where the first equality comes from 6.46(c). Writing 


v= Pv + (v — Pv), 
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we have Pv € range P and (v — Pv) € (range P)+. Thus Pv = Prange PU. 
Because this holds for all v € V, we have P = Prange P, Which shows that P 
is an orthogonal projection. 


5. Show that if dimV > 2, then the set of normal operators on V is not a 
subspace of L(V). 


SOLUTION: Suppose dim V > 2. Let (e1,...,en) be an orthonormal 
basis of V. Define S,T € L(V) by 


S(aye1 + --- + anen) = a2€1 — a1€2 
and 
T(aiei ++- + anen) = age; + a162. 
A simple calculation verifies that 
S* (aiei +-+- + anen) = —ae] + a162. 


From this formula, another simple calculation shows that SS* = S*S. Yet 
another simple calculation shows that T is self-adjoint. Thus both S and T 
are normal. However, S +T is given by the formula 


(S +T)(aye1 + -++ + anen) = 2a2€1. 
A simple calculation verifies that 
(S + T)*(aye; +++: + anen) = 2a1e2. 


A final simple calculation shows that (S +T)(S +T)* 4 (S+T7)*(S+T). 
In other words, S +T is not normal. Thus the set of normal operators on 
V is not closed under addition and hence is not a subspace of L(V). 


6. Prove that if T € L(V) is normal, then 
range T = range T*. 
SOLUTION: Suppose T is normal. Then 
rangeT = (nullT*)+ 


= (null T)+ 
= range T*, 
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where the first equality comes from 6.46(d), the second equality comes from 
7.6 (see especially the marginal comment at 7.6), and the third equality 
comes from 6.46(b). 


7. Prove that if T € £(V) is normal, then 
null TF = nullT and range TF = rangeT 


for every positive integer k. 


SOLUTION: Suppose T € L(V) is normal and that k is a positive integer. 
Obviously we can assume that k > 2. 
First we will prove that null T* = null T. If v € nullT, then 


and so v € null TF. Thus nullT C null T*. 
To prove an inclusion in the other direction, suppose now that v € null T*. 
Then 


(T*T*1y, T*T 1p) = (TT*T ly, TH 0) 
= (T*T*v, Tv) 


where the second equality holds because T*T = TT*. The last equality 
above implies that T*T*—!y = 0. Thus 


0 = (T*T*- tv, T*-2v) 
= (Tey, Tp). 


Hence T*—!y = 0. In other words, v € null T*-}. The same argument, with 
k replaced with k — 1, shows that v € null T*-?. Repeat this process until 
reaching the conclusion that v € nullT. This shows that null TF C nullT, 
completing that proof that null T* = null T.. 

Now we will show that rangeT* = rangeT. If v € range T*, then there 
exists u € V such that v = TFu = T(T*—!)u, which implies that v € range T. 
Thus range TF C rangeT. Note that 


80 CHAPTER 7. Operators on Inner-Product Spaces 


10. 


dim range T* = dim V — dim null T* 
= dim V — dim null T 
= dimrangeT, 


where the first and third equalities come from 3.4 and the second equality 
comes from the first part of this exercise. Because range T* and range T 
have the same dimension and one of them is contained in the other, these 
two subspaces of V must be equal, completing the proof. 


Prove that there does not exist a self-adjoint operator T € L(R3) such that 
T(1, 2,3) = (0,0,0) and T(2, 5,7) = (2,5, 7). 


SOLUTION: Suppose T € £(R4) is such that T(1,2,3) = (0,0,0) and 
T(2,5,7) = (2,5,7). Obviously (1,2,3) is an eigenvector of T with eigen- 
value 0 and (2,5,7) is an eigenvector of T with eigenvalue 1. If T were 
self-adjoint, then eigenvectors corresponding to distinct eigenvalues would 
be orthogonal (see 7.8). Because (1,2,3) and (2,5,7) are not orthogonal, 
T cannot be self-adjoint. 


Prove that a normal operator on a complex inner-product space is self- 
adjoint if and only if all its eigenvalues are real. 


COMMENT: This exercise strengthens the analogy (for normal operators) 
between self-adjoint operators and real numbers. 


SOLUTION: Suppose V is a complex inner product space and T € L(V) 
is normal. 

If T is self-adjoint, then by 7.1 all its eigenvalues are real. 

Conversely, suppose that all the eigenvalues of T are real. By the com- 
plex spectral theorem (7.9), there is an orthonormal basis (e€1,...,én) of V 
consisting of eigenvectors of T. Thus there exist real numbers 1,...,An 
such that Te; = A;e; for j = 1,...,n. .The matrix of T with respect to 
the basis (€1,...,€n) is the diagonal matrix with d1,..., An on the diagonal. 
This matrix equals its conjugate transpose. Thus T = T*. In other words, 
T is self-adjoint, as desired. 


Suppose V is a complex inner-product space and T € L(V) is a normal 
operator such that T? = T8. Prove that T is self-adjoint and T? =T. 


SOLUTION: By the complex spectral theorem (7.9), there is an orthonor- 
mal basis (e;,...,€n) of V consisting of eigenvectors of T. Let j,...,An be 
the corresponding eigenvalues. Thus 
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11. 


12. 


Te; = Ajej 


for j =1,...,n. Applying T yepsatedy to both ades of the equation above, 
we get Te, = = A;%e; and T®e; = Ajŝej. Thus 4,9 = À;8, which implies that 
A; equals 0 or 1. i particular, all the eigenyälita of T are real. This implies 
(by the previous exercise) that T is self-adjoint. 

Applying T to both sides of the equation above, we get 


where the second equality holds because A; equals 0 or 1. Because T? and 
T agree on a basis, they must be equal. 


Suppose V is a complex inner-product space. Prove that every normal 
operator on V has a square root. (An operator S € L(V) is called a square 
root of T € L(V) if S? =T.) 


SOLUTION: Suppose T € L(V) is normal. By the complex spectral 
theorem (7.9), there is an orthonormal basis (e1, ..., €n) of V consisting of 


eigenvectors of T. Thus there exist complex numbers \;,...,A, such that 
Te; = je; for j = 1,...,n. Define S to be the operator on V such that 
Se; = Aj ej for j = 1,...,n; here AY ? denotes a complex square root 


of Aj (every nonzero complex number has two square roots—it does not 
matter which one is chosen). Then, as is easy to verify, S? = T. Thus S is 
a square root of T. 


Give an example of â real inner-product space V and T € L(V) and real 
numbers @, 8 with a? < 42 such that T? + aT + BI is not invertible. 


COMMENT: This exercise shows that the hypothesis that T is self-adjoint 
is needed in 7.11, even for real vector spaces. 


SOLUTION: Let T € £(R?) be the counterclockwise rotation on R2; so 
T(z,y) = (— —y,2) for (x,y) € R?. Thus T? = —J. Taking a = 0 and 8 = 1, 
we have a” < 46 and 

T? +aT+BI=T? +I 
=0. 


In particular, T? + aT + BI is not invertible. 
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13. 


14. 


Prove or give a counterexample: every self-adjoint operator on V has a cube 
root. (An operator S$ € L(V) is called a cube root of T € L(V) if $? =T.) 


SOLUTION: Suppose T € L(V) is self-adjoint. By the spectral theorem 
(7.13), there is an orthonormal basis (e1,...,€n) of V consisting of eigen- 
vectors of T. The corresponding eigenvalues must be real (by 7.1). Thus 
there exist real numbers A),...,An such that Te; = Aje; for j = 1,...,n. 
Define S to be the operator on V such that Se; = Aje; for j = 1,...,n. 
Then, as is easy to verify, S? = T. Thus S is a cube root of T, completing 
the proof that every self-adjoint operator on V has a cube root. 


Suppose T € L(V) is self-adjoint, \ € F, and € > 0. Prove that if there 
exists v € V such that ||v|| = 1 and 


|Tv — Av|| < €, 
then T has an eigenvalue à’ such that |A — \/| < €. 


SOLUTION: By the spectral theorem (7.13), there is an orthonormal 
basis (e1,...,@n) of V consisting of eigenvectors of T. Let X1,...,An be the 
corresponding eigenvalues. 

Suppose v € V is such that ||v|| = 1 and |[Tv — Av|| < e. From 6.17 we 
have 


v = (v,e1)e1 +--+ + (u,en)en; 
and so 
Tv = Ax (v,e1)e1 +--+ + An(v, Enjen. 
Thus 
e > ||Tv — dol” . 
= |[(A1 — A) (v, ex)e1 +--+ + (An — A) (v, en) en? 
= [Ay = Alio, e1)|? bees o [An Ey Afilo, en) |” 


> (min{|Ai — Al?,..., [An — APH (lo, e1) |? +--+ + Ho, en) [?) 
= min{|A, — Al?,..., [An — A|} 


Thus € > |A; — À| for some j. In other words, there is an eigenvalue whose 
distance from À is less than <€, as desired. 
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15. 


16. 


17. 


18. 


Suppose U is a finite-dimensional real vector space and T € £L(U). Prove 
that U has a basis consisting of eigenvectors of T if and only if there is an 
inner product on U that makes T into a self-adjoint operator. 


SOLUTION: First suppose that U has a basis (e1,..., én) of eigenvectors 
of T. Because (€1,...,€n) is a basis of U, every element of U can be uniquely 
written as a linear combination of (e1, -.., €n). Thus we can define an inner 
product on U by . 


(aiei + +++ + anen, b161 +--+: + bpen) = aybi +--+ + nbn. 


It is easy to verify that this is indeed an inner product on U and that 
(e1,..-,€n) is on orthonormal basis of U with respect to this inner product. 
Because each e; is an eigenvector of T, the operator T has a diagonal matrix 
with respect to the orthonormal basis (e1, ..., €n). Thus T is self-adjoint. 

Conversely, now suppose that there is an inner product on U that makes 
T into a self-adjoint operator. Then by the spectral theorem (7.13), U has 
a basis consisting of eigenvectors of T. 


Give an example of an operator T on an inner product space such that T 


has an invariant subspace whose orthogonal complement is not invariant 
under T. 


COMMENT: This exercise shows that 7.18 can fail without the hypothesis 
that T is normal. 


SOLUTION: Define T € L(F?) by T(w, z) = (z,0). Then T(w,0) = (0,0) 
for all w € F. Thus the subspace U defined by U = {(w,0) : w € F} is 
invariant under T. However, U+ = {(0,z) : € F}, which is not invariant 
under T because (0,1) € U+ but T(0, 1) = (1,0) ¢ UŁ. 

Of course there are also many other examples. 


Prove that the sum of any two positive operators on V is positive. 


SOLUTION: Suppose S and T are positive operators on V. Because S 
and T are self-adjoint, so is S + T. Furthermore, 


((S +T)v, v) = (Sv, v) + (Tv, v) 
> 0. 


Thus S +T is a positive operator, as desired. 


Prove that if T € L(V) is positive, then so is T* for every positive integer k. 
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SOLUTION: Suppose T € L(V) is positive and k is a positive integer. 
Then T* is self-adjoint (because T is self-adjoint). 

First consider the case where k is an even integer. Then we can write 
k = 2m for some positive integer m. Now 


(T¥v, v) = (T?™v, v) 
= (T™v, T™v) 
20 


for every v € V, where the second equality holds because T is self-adjoint. 
The inequality above shows that T* is positive, as desired. 

Now consider the case where k is an odd integer. Then we can write 
k = 2m + 1 for some nonnegative integer m. Now 


(Tey, v) = (T2™*1y, v) 
= (T(T™v), Tv) 
>0 


for every v € V, where the second equality holds because T is self-adjoint 
and the inequality holds because T is positive. The inequality above shows 
that T* is positive, as desired. 


Suppose that T is a positive operator on V. Prove that T is invertible if 
and only if 


(Tv,v) >0 
for every v € V \ {0}. 


SOLUTION: First suppose that T is invertible. By 7.27, there exists an 
operator S € L(V) such that T = S*S. Suppose v € V \ {0}. Then Sv #0 
because otherwise we would have Tv = S*Sv = 0, which would contradict 
the invertibility of T. Now i 


(Tv, v) = (S*Sv,v) 
= (Sv, Sv) 
>0, 


as desired. 

Now suppose that (Tv, v) > 0 for every v € V \ {0}. In particular, this 
means that Tv Æ 0 for every v € V \ {0}. Thus T is injective, and hence T 
is invertible (see 3.21), as desired. 
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21. 


22. 


23. 


Prove or disprove: the identity operator on F? has infinitely many self- 
adjoint square roots. 


SOLUTION: For each t € [—1, 1], the operator whose matrix (with respect 
to the standard basis) equals 


vce | 
VIZË t 


is self-adjoint and a square root of the identity operator, as can be verified by 
squaring the matrix above. Thus the identity operator has infinitely many 
self-adjoint square roots. 


Prove or give a counterexample: if S € L(V) and there exists an orthonormal 
basis (e1, . .., €n) of V such that ||Se,|| = 1 for each e;, then S is an isometry. 


SOLUTION: Define S € L(F?) by 
S(w, z) = (w + z,0). 


With the usual inner product on F?, the standard basis ((1, 0), (0, 1)) is an 
orthonormal basis of F?. Note that ||$(1,0)|| = ||S(0,1)|| = 1. However, S 
is not an isometry because ||S(1, —1)|| = 0. 

Of course there are also many other examples. 


Prove that if S € £(R%) is an isometry, then there exists a nonzero vector 
z E€ RÌ such that Sz = z. 


SOLUTION: Suppose S € £(RÌ) is an isometry. Then there is a basis of 
R? with respect to which S has a block diagonal matrix, where each block 
on the diagonal is a 1-by-1 matrix containing 1 or —1 or is a 2-by-2 matrix 
(see 7.38). Because RÌ has odd dimension, at least one of these blocks must 
be a 1-by-1 matrix. In other words, either 1 or —1 must be an eigenvalue 
of S. Thus there is a nonzero vector x € R such that Sz = Ax, where 
A= +1. Hence 


S°x = §(Sz) = S(Az) = AST = Xs = T. 
Define T € L(F?) by 
T(a, 22) 23) = (23, 221, 322). 


Find (explicitly) an isometry S € L(F°) such that T = SV/TT. 
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SOLUTION: With respect to the standard basis of F3, we have 


0 0 i 
M((T)=|2 00 
030 
Thus 
0 2 0 
M(T*)={|0 0 3]. 
1 0 0 


Computing the product M(I*)M(T), which equals M(T*T), we get 
40 0 
M(T'T)=|0 9 0]. 
001 


From the matrix above, we see that (I*T)(z1, z2, 23) = (421, 922, z3). Thus 
VT*T (21, 22, 23) = (221,322, 23). Hence if we define S € £(F%) by 


S(z1, 22; 23) = (23, 21; z2), 
then S is an isometry and T = SVT*T. 


Suppose T € L(V), S € L(V) is an isometry, and R € L(V) is a positive 
operator such that T = SR. Prove that R = /T*T. 


COMMENT: This exercise shows that if we write T as the product of an 
isometry and a positive operator (as in the polar decomposition), then the 
positive operator must equal /T*T. 


SOLUTION: Taking adjoints of both sides of the equation T = SR, we 
have 


T* = R*S* 
= RS", 


where the last equation holds because R is positive (and hence self-adjoint). 
Multiplying together our formulas for T* and T, we get 


T*T = RS*SR 
= R? 


87 CHAPTER 7. Operators on Inner-Product Spaces 


where the last equation holds because S is an isometry (and hence $*S = I 
by 7.36). The equation above asserts that R is a square root of T*T; because 
R is positive, this implies that R = /T*T. 


25. Suppose T € L(V). Prove that T is invertible if and only if there exists a 
unique isometry S € L(V) such that T = SVT*T. 


SOLUTION: First suppose that T is invertible. The polar decomposition 
(7.41) states that there exists an isometry S € L(V) such that 


T = SVTT. 


Because T is invertible, this implies that /T*T is invertible (see Exercise 22 


in Chapter 3). Thus the equation above implies that S = T(VT'TY `. 
Because S must be given by this formula, we see that there is a unique 
operator S € L(V) such that T = SVT*T, as desired. 

Now suppose that there exists a unique isometry S € L(V) such that 
T =SVT*T. This means that the linear map S2 in the proof of the polar 
decomposition (7.41) must be 0 because otherwise we could replace Sz with 
—S_ and get another choice for S. But range Sz equals (rangeT)+, and 
hence (range T)+ = {0}. This implies that range T = V, which implies that 
T is invertible (by 3.21), as desired. 


26. Prove that if T € L(V) is self-adjoint, then the singular values of T equal 
the absolute values of the eigenvalues of T (repeated appropriately). 


SOLUTION: Suppose T € L(V) is self-adjoint. There exists an orthonor- 
mal basis (€1,...,@,) of V consisting of eigenvectors of T. Thus 


Te; = Aje; 
for each 7, where 4j,...,An E€ R are the eigenvalues of T. Thus 
T*Te; = Te, 
= (Aj)’e; 
for each j. The equation above implies that /T*Te; = |A;le; for each j. 
Thus the singular values of T are |\,|,..-,|An|, as desired. 


27. Prove or give a counterexample: if T € L(V), then the singular values of T? 
equal the squares of the singular values of T. 


SOLUTION: Define T € L(F?) by 
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29. 


30. 


T(z, z2) = (z2,0). 


Then T*T(z21, 22) = (0, z2) and hence VT*T(z1, 22) = (0,22). Thus the 
eigenvalues of VT*T are 0,1. Hence the singular values of T are 0,1. 
However, T? = 0, so the singular values of T? are 0,0. Thus for this 
operator T, the singular values of T? do not equal the squares of the singular 
values of T. 
Of course there are also many other examples. 


Suppose T € L(V). Prove that T is invertible if and only if 0 is not a 
singular value of T. 


SOLUTION: If S € L(V) and ST = TS = I, then taking adjoints we get 
T*S* = S*T* = I. Thus if T is invertible, then so is T*. 
Now 
T is invertible <> T and T* are invertible 

<=> T"T is invertible 
<=> VT*TVT"T is invertible 
<=> VT*T is invertible 
<= 0 is not an eigenvalue of VT 
<= 0 is not a singular value of T, 


where the second and fourth equivalences follow from Exercise 22 in Chap- 
ter 2. 


Suppose T € L(V). Prove that dimrangeT equals the number of nonzero 
singular values of T. 


SOLUTION: By the singular value decomposition (7.46), there exist or- 
thonormal bases (u,...,Un) and (w1, ..., Wn) of V such that 


Tv = sı (v, uy)wy +-+- + Sn (v, Un) Wr 


for every v € V, where s),..., 5, are the singular values of T. For each j, 
we have Tu; = sj;w;. Thus each wj corresponding to a nonzero sj is in 
rangeT. The equation above also shows that the w,’s corresponding to 
nonzero s;’s span rangeT. Thus dim rangeT equals the number of nonzero 
singular values of T. 


Suppose S € L(V). Prove that S is an isometry if and only if all the singular 
values of S equal 1. 
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SOLUTION: We have 


S is an isometry => S*S =I 
=> VSS =I 
<=> all the eigenvalues of /S*S equal 1 
<= all the singular values of S equal 1, 


where the first equivalence comes from 7.36 and the third equivalence comes 
from the spectral theorem (7.9 or 7.13) applied to the self-adjoint operator 


VSS. 


Suppose T1, T2 € L(V). Prove that T; and T> have the same singular values 
if and only if there exist isometries S1, S2 E L(V) such that T} = STe S2. 


SOLUTION: First suppose that T) and Tz have the same singular values 
$1,--.,8n. By the singular-value decomposition (7.46), there exist orthonor- 
mal bases (€1,..-,€n),(fi,--+5 fn), (€13 -- 3€) (fl, ---, fL) of V such that 

Tiv = s1(v,e1) fi +--+ + 5n(v, en) fa, 
Tov = si (v, e1) ft +--+ + SnU, en) fa 


for every v € V. Define S1, S2 € L(V) by 


Si(afi +: + anfa) = afi +--+ anfa, 
So(ayey + +--+ anen) = areh +--+ anel. 


Then 


lSilafi +--+ anfall? = llarfi +--- + anfall? 
= |u|? +--+ lanl? 
= jla fi +--+ an fill?, 


and thus S; is an isometry. Similarly, Sj is an isometry. This implies that 
S2* = S27! (see 7.36). In particular, S2*e; = ej. Now for v € V we have 


T2(S2v) = $1 (Sov, enti Titi Sn (S20, en) fn 
= sı (v, S2*eh) fi +--+ + Srv, Sael) fi 
silve) fi tee 8n(U, En) fn- 
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Thus 


Si(T2S2v) = s1(v,e1) Sif, Heee + Sn(v, en) Si fh 
= sıv, eifi se a Sn(v, en) ta 
= Tyv 


for every v € V. Hence SıTzS2 = T), as desired. 
To prove the implication in the other direction, now suppose that there 
exist isometries $1, S2 E L(V) such that Ti = SıT2S2. Using 7.36, we have 


Ty*T, = So*T2* Si STS 
= S2 T DS. 


This implies that 7)*T; and T2*Tə have the same eigenvalues (and that the 
corresponding spaces of eigenvectors have the same dimensions). Thus T} 
and Tz have the same singular values. 


Suppose T € £(V) has singular-value decomposition given by 
Tv = 51(v,€1) fi + +++ + Sn (0; €n) fn 


for every v € V, where s1,..., Sn are the singular values of T and (e},...,é€n) 
and (f1,..-; fn) are orthonormal bases of V. 


(a) Prove that 
T*v = sı (v, fider +--+ Sn(v, fnden 
for every v E V. 


(b) Prove that if T is invertible, then 


T-ly = {v, fiden Fat (v, fn)en 
si Sn 
for every v E V. 
SOLUTION: (a): Fix v € V. Then 
(w, T*v) = (Tw, v) 
= (sı(w, e1) fi +-+- + Sn(w, en) fn v) 
= si(w, ei) (fi, v) ess Sn(w, €n) (fn, v) 
= (w, sı (v, filer +-+- + Sn(V, fren) 
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for all w € V. This implies that 


T*v= sı (v, fier tere Sn (V, fren, 


as desired. 
(b): Suppose T is invertible. Let v € V and let 


w= fader, 4 Wifaden, 
$1 Sn 


none of the singular values s1,...,5, equals 0 (see Exercise 28 of this chap- 
ter), so this makes sense. Now 


Tw = Ter |., y W fa) Ten 
S1 En 

= Wfderf q... q Wifadsafa 
S1 Sn 
= (v, fi) fi + + (v, fn) fn 


=v. 
Thus w = T~!v, as desired. 


Suppose T € L(V). Let § denote the smallest singular value of T, and let s 
denote the largest singular value of T. Prove that 


slull < (Tull < sllol| 
for every v E€ V. 


SOLUTION: Let v € V. By the singular value decomposition (7.46), 
there exist orthonormal bases (u),...,Un) and (wi,...,wWn) of V such that 


Tv = s (uv, uy) wy +--+ + Sn{v, Un) Wn, 


where s1,...,5n are the singular values of T. Because (uj,...,t,) and 
(wi,..., Wn) are both orthonormal bases of V, we have 


3 lloll? = 87(|(v, u1)? + --- + Io, ua) |?) 
< s1? |(v, ur)? Peret Sn” |(v, Un) |? 
Toll’, 


92 CHAPTER 7. Operators on Inner-Product Spaces 


34. 


giving the first desired inequality. Also, 


Tol? = s17|(v, u)? +-+- + 5n7|(v, un) ? 
< (v, u)? + +++ + |(v, un) |?) 
= s*|lv|l?, 


giving the second desired inequality. 


Suppose T”, T” € L(V). Let s’ denote the largest singular value of T”, let s” 
denote the largest singular value of T”, and let s denote the largest singular 
value of TY + T”. Prove that s < s$ + s". 


SOLUTION: Let T = T’ + T". Because s is a singular value of T, we 
know that s is an eigenvalue of VT*T. Thus there exists a vector v € V 
such that ||v|| = 1 and VT*Tv = sv. Now 


s = |jsvl| 

= ||VT*To|| 

= ||To|| 

= |Tv + Tvl 

< [Tol] + [TI] 

< s'|[v|| + "|u| 

Zal + gl! 
where the third line above comes from 7.42 and the sixth line above comes 
from the previous exercise. 


CHAPTER 8 


Operators on | 
Complex Vector Spaces 


Define T € £(C?) by 
T(w, z) = (z,0). 
Find all generalized eigenvectors of T. 


SOLUTION: Suppose À is an eigenvalue of T. For this particular operator, 
the eigenvalue-eigenvector equation T(w, z) = A(w,z) becomes the system 
of equations 


z = àw 
0 = Az. 


If A Æ 0, then the second equation implies that z = 0, and the first equation 
then implies that w = 0. Because an eigenvalue must have a nonzero eigen- 
vector, this shows that 0 is the only possible eigenvalue of T. For À = 0, the 
equations above show that z must equal 0, but w can be arbitrary. Thus 0 
is indeed an eigenvalue of T, and the set of eigenvectors corresponding to 
this eigenvalue is 


{(w,0): w € F}. 


Note that T? = 0. Thus every vector in C? is a generalized eigenvector 
of T (corresponding to the eigenvalue 0). 
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2. Define T € L(C?) by 
T(w, z) K (-z, w). 
Find all generalized eigenvectors of T. 


SOLUTION: On page 78 of the textbook we saw that the eigenvalues of 
T are i and —i. Note that T? = —I. 

The set of generalized eigenvectors of T corresponding to the eigenvalue i 
equals null(T — il)? (by 8.7). To compute this, note that 


(T -il =T? -2T -I 
= —21I — 2iT 
= —2i(T — il). 
Thus nul(T — iJ)? equals the set of eigenvectors of T corresponding to 
the eigenvalue 7. On page 78 of the textbook we noted that this equals 
{(a, -ia): a E€ C}. 
The set of generalized eigenvectors of T corresponding to the eigen- 
value —i equals null(T + iJ)? (by 8.7). To compute this, note that 


(T +i)? =T? +2T-1 
= —21 + 2iT 
2i(T + il). 


Thus nuli(T + iI)? equals the set of eigenvectors of T corresponding to 
the eigenvalue —i. On page 78 of the textbook we noted that this equals 
{(a,ia): aE C}. 


3. Suppose T € L(V), mis a positive integer, and v € V is such that T™-ly #0 
but T™v = 0. Prove that 


(v,Tv,T?v,... Ty) 
is linearly independent. 
SOLUTION: Suppose do, 41,4Q2,...,âm-—1 € F are such that 
aov + ayTu + oT? + +--+ am—1T™ lv = 0. 


Because T™v = 0, if we apply T™~! to both sides of the equation above, we 
get ag” 'v = 0. Because T™~'v # 0, this implies that ag = 0. Thus the 
equation above can be rewritten as 


95 CHAPTER 8. Operators on Complex Vector Spaces 


ayTut aoT?v tees am-iT™ w =0. 


Applying T™~? to both sides of this equation, we get a; T”—!v = 0. Thus 
a; = 0. Continuing in this fashion, we have ao = a1 = a2 = +--+ =Qm_1 = 0, 
which means that (v, Tv, T?v, ..., T™-}v) is linearly independent. 

4. Suppose T € L(C?) is defined by T(z, z2, z3) = (z2, 23,0). Prove that T has 


no square root. More precisely, prove that there does not exist S € £(C3) 
such that $? =T. 


SOLUTION: Note that T? = 0. Suppose there exists S € £(C3) such 
that S? = T. Then SÊ = TS = 0, so S is nilpotent. By 8.8, this implies that 
S? = 0. Thus 

Tes 
= SS’? 
= 0. 


But T? (z1, z2, 23) = (z3,0,0), so T? is not the 0 operator, contradicting the 
equation above. This contradiction shows that our supposition that there 
exists S € £(C%) such that S? = T must have been false. 


5. Suppose S,T € L(V). Prove that if ST is nilpotent, then TS is nilpotent. 


SOLUTION: Suppose ST is nilpotent. Thus there exists a positive integer 
n such that (ST)" = 0. Now 


(TS)"*! = (TS)(TS)...(TS) 
= T(ST)(ST)...(ST)S 
= T(ST)"S 
= (T)(0)(S) 
= 0, ° 


and thus TS is nilpotent. 


6. Suppose N € L(V) is nilpotent. Prove (without using 8.26) that 0 is the 
only eigenvalue of N. 


SOLUTION: There is a positive integer m such that N™ = 0. This implies 
that N is not injective, so 0 is an eigenvalue of N. 

Conversely, suppose À is an eigenvalue of N. Then there exists a nonzero 
vector v € V such that 
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dv = Nv. 
Repeatedly applying N to both sides of this equation shows that 
ATu = N™v 
= 0. 
Thus A = 0, as desired. 


Suppose V is an inner-product space. Prove that if N € L(V) is self-adjoint 
and nilpotent, then N = 0. 


SOLUTION: Suppose N € L(V) is self-adjoint and nilpotent. Because N 
is self-adjoint, there is an orthonormal basis (€1,...,¢€n) of V consisting of 
eigenvectors of N (by the spectral theorem). Because N is nilpotent, 0 is the 
only eigenvalue of N (see Exercise 6 of this chapter). Thus the eigenvalue 
corresponding to each e; must equal 0. In other words, Ne; = 0 for each j. 
Because (€1,...,€n) is a basis of V, this implies that N = 0. 


Suppose N € L(V) is such that null N4™Y-! Æ null Nä™V., Prove that N 
is nilpotent and that 

dim null NJ = j 
for every integer j with 0 < j < dim V. 


SOLUTION: Because null NË™V-1 ¥ null NÄ™V , we know (by 8.5) that 
null NJ~! Æ null NI whenever 0 < j < dim V. Thus 


{0} = null N° ¢ null N? Ç -Ç null y#m V- ç null Nim VY, 


At each of the strict inclusions in the chain above, the dimension must in- 
crease by at least 1. However, if the dimension increases by more than 1 
at any step, we would end up with dim null N@™Y > dimV, a contradic- 
tion because a subspace of V cannot have dimension larger than dim V. 
Thus the dimension increases by exactly one at each step. In other words, 
dim null NJ = j for every integer j with 0 < j < dimV. In particular, 
taking j = dim V, we have dimnull NSV = dimV. This means that 
null Nd™V = V, Thus N#™V — 0, and so N is nilpotent. 


Suppose T € L(V) and m is a nonnegative integer such that 
rangeT™ = rangeT™*?. 


Prove that range T* = range T™ for all k > m. 
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SOLUTION: Suppose u € range T™+t!, Thus there exists a vector v in V 
(the domain of T) such that u = T™*1y. Now T™v is in rangeT™, which 
by our hypothesis equals range T™+!. Thus there exists w € V such that 
T™y = T™+!w. Putting all this together, we have 


u=T™ly 

= T(T™v) 

= T(T™+ wy) 

= TTR wy, 
Thus u € rangeT™+?. Because u was an arbitrary vector in rangeT™+! we 
have shown that rangeT™+! C rangeT™**. We also have an easy inclusion 
in the other direction, so we conclude that rangeT™+! = range T™*?, 

In the paragraph above, we showed that range T™ = range T™*! implies 

rangeT™+! = rangeT™*?, Apply that result, with m replaced with m + 1, 


to conclude that rangeT™+? = range T™+3, Continuing in this fashion, we 
see that 


rangeT™ = range T™+! = rangeT™?? = ..., 
as desired. 
10. Prove or give a counterexample: if T € L(V), then 
V = null T @ rangeT. 
SoLUTION: Define T € L(F?) by T(w, z) = (z,0). Thus 
nullT = range T = {(w,0): w € F}, 


which clearly implies that F? is not the direct sum of null T and range T. 
Of course there are also many other examples. 


1l. Prove that if T € L(V), then 
V = null T” @ range T”, 
where n = dim V. 
SOLUTION: Let T € L(V). First we show that 


V = null T” + range T”. 


98 CHAPTER 8. Operators on Complex Vector Spaces 


12. 


13. 


To do this, let v € V. Then 
. v = (v — T"u) +T*u 


for any vector u € V. Obviously T”u € rangeT”. Thus looking at the 
equation above, we see that we need to show that there exists u € V such 
that v — T”u € nullT”. In other words, we want a vector u € V such 
that T”(v — T”u) = 0, which is equivalent to T”v = T?"u. But T”v € 
rangeT”, and rangeT™ = range T?” (by 8.9), so T”u € rangeT?™. Thus 
there indeed exists u € V such that T™v = T?"u, completing our proof 
v E€ null T” +range T”. Because v was an arbitrary vector in V, this implies 
that V = null T” + range T”. 

For any linear map (and in particular for T”), the dimension of the 
domain equals the sum of the dimensions of the null space and range (by 3.4). 
In other words, 


dim V = dim null T” + dim range T”. 


This equation, along with the equation V = null T” + range T”, implies that 
V = null T” @ range T” (by 2.19). 


Suppose V is a complex vector space, N € L(V), and 0 is the only eigenvalue 
of N. Prove that N is nilpotent. Give an example to show that this is not 
necessarily true on a real vector space. 


SOLUTION: Because 0 is the only eigenvalue of N, 8.23(a) implies that 
every vector in V is a generalized eigenvector of T corresponding to the 
eigenvalue 0. This implies that N is nilpotent. 

Define T € £(R3) by 


T(z, yY, z) = (-y, T, 0). 


Then 0 is an eigenvalue of T because T(0,0,1) = (0,0,0). As can be ver- 
ified from the definition of eigenvalue, T has no other eigenvalues (which 
must be in R, because T is an operator on a real vector space). However, 
T3(zx,y,z) = (y, —2,0). In particular, T3 4 0. Thus T is not nilpotent. 

Of course there are also many other examples. 


Suppose that V is a complex vector space with dim V = n and T € L(V) is 
such that 


null T7? £ null T”! 


Prove that T has at most two distinct eigenvalues. 
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SOLUTION: Because null 7”? 4 nullT™-!, we see that dim null TÍ is 
at least, 1 more than dim null TÍ-! for j = 1,...,n — 1 (by 8.5). Thus 
dim null T"~! > n—1. In particular, 0 is an eigenvalue of T with multiplicity 
at least n — 1. Because the sum of the multiplicities of all the eigenvalues 
of T equals n (by 8.18), this implies that T' can have at most one additional 
eigenvalue. 


Give an example of an operator on C4 whose characteristic polynomial 
equals (z — 7)? (z — 8)?. 


SOLUTION: Define T € £(C*) by 
T (21, 22, 23, z4) = (721, 7z2, 8z3, 8z4). 
Then null(T — 7J) is the two-dimensional subspace 
{(z1, z2,0,0) : 21,22 E C} 
and null(T — 87) is the two-dimensional subspace 
{(0, 0, z3, z4) : 23, z4 € C}. 


Thus 7 is an eigenvalue of T with multiplicity at least 2 and 8 is an eigenvalue 

of T with multiplicity at least 2. Because 2+2 = 4 = dim C‘, there can be no 

other eigenvalues of T and the eigenvalues 7 and 8 must have multiplicity 2 

(by 8.18). Thus the characteristic polynomial of T' equals (z — 7)?(z — 8)?. 
Of course there are also many other examples. 


Suppose V is a complex vector space. Suppose T € L(V) is such that 5 and 
6 are eigenvalues of T and that T has no other eigenvalues. Prove that 


(T —51)"-1(T — 67)" =0, 
where n = dim V. 


SOLUTION: Because 5 and 6 are eigenvalues of T and T has no other 
eigenvalues, the characteristic polynomial of T must be of the form 


(z —5)9(z—6)®, 


where 1 < dı and 1 < dg. Because dj +d2 = n, we must also have dj < n- 1 
and dz < n — 1. The Cayley-Hamilton theorem implies that 


(T — 51)" (T - 61)? =0. 
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Because dı < n — 1 and dz < n — 1, we can multiply the equation above by 
appropriate powers of T — 5I and T — 61 to get (T —5I)""1(T—6I)""! = 0. 


Suppose V is a complex vector space and T € L(V). Prove that V has a basis 
consisting of eigenvectors of T if and only if every generalized eigenvector 
of T is an eigenvector of T. 


COMMENT: For complex vector spaces, this exercise adds another equiv- 
alence to the list given by 5.21. 


SOLUTION: First suppose that V has a basis consisting of eigenvectors 
of T. Thus there exists a basis (v1,...,Un) of V and à4,..., An E€ C such 
that Tv; = A;v; for each j. Suppose v € V is a generalized eigenvector of T 
corresponding to an eigenvalue À. Because (v1,..., Un) is a basis of V, there 
exist @),...,@n E C such that 


V = QV +-+-+ApUn- 
Thus 
(T — AD) = (Ar — A)ayyy +--+ + An — A)anUn. 
Applying T — XI repeatedly to both sides of this equation, we get 
(T — AD)?u = (Ay — A)"aiay +--+ (An — A)" antn. 


The left side of the equation above equals 0 (because v is a generalized 
eigenvector of T corresponding to the eigenvalue A). Thus the right side 
of the equation above equals 0. Thus (A; — A)"a; = 0 for each j. This 
implies that the indices j such that a; # 0 must all have all satisfy A; = 4, 
which means that v is a linear combination of the v;’s that correspond to 
eigenvalue A, which means that v is eee an eigenvector corresponding to 
eigenvalue À, as desired. 

To prove the other direction, now suppose that every generalized eigen- 
vector of T is an eigenvector of T. By 8.25, there exists a basis of V consisting 
of generalized eigenvectors of T. Because every generalized eigenvector of T 
is an eigenvector of T, this gives a basis of V consisting of eigenvectors of T, 
as desired. 


Suppose V is an inner-product space and N € L(V) is nilpotent. Prove 
that there exists an orthonormal basis of V with respect to which N has an 
upper-triangular matrix. 
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SOLUTION: By 8.26, there is a basis of V with respect to which N 
has an upper-triangular matrix. By 6.27, this basis can be chosen to be 
orthonormal. (If V is a complex inner-product space, then we can just use 
6.28 directly, so the hypothesis that N is nilpotent is not needed.) 


Define N € £(F5) by 
N(zı, T2, T3, T4, T5) = (222, 323, —T4, 4z5, 0). 

Find a square root of I + N. 

SOLUTION: Note that 

N? (z, T2, T3, T4, z5) = (623, —324, —4z5, 0, 0) 
N3(x1, T2, T3, T4, T5) = (—6z14, —12z25, 0, 0, 0) 
Nt (z, T2, T3, T4, Zs) = (—24z5, 0, 0, 0, 0) 
N5(z1, 22, T3, T4, T5) = (0,0, 0, 0, 0). 

Because N5 = 0, the proof of 8.30 shows that 


i RNE ee rae ee = A 
r NN aoa 


is a square root of I + N. Using the formulas above, we calculate that the 
operator S € L(F*) defined by 


S(x1, T2, T3, T4, T5) = 


zi +r- —> H H, mt OL 


( 323 324 1525 323 3T4 325 
4 8 16 2 8 4°. 


T3 — = +5, T4 + 255, zs) 


is a square root of I + N. 


Prove that if V is a complex vector space, then every invertible operator 
on V has a cube root. 


SOLUTION: First suppose that N € L(V) is nilpotent. We will show 
that J + N has a cube root by imitating the proof of 8.30. Specifically, we 
guess that there is a cube root of I + N of the form 


I+a,N +a2N? +- Hama NL, 
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where m is such that N™ = 0. Having made this guess, we can try to choose 
Q1,92,...,;@m—1 SO that the operator above has its cube equal to J+ N. Now 


(I +a,N + agN? + a3N3 +---+ am 1N™ 13 
=I+3a,N + (3a2 + 3a,7)N? + (3a3 + baia + ay>)N? +- 
+ (3am—1 + terms involving a1,...,@m-2)N™!. 


We want the right side of the equation above to equal J + N. Hence choose 
a, so that 3a; = 1 (thus a; = 1/3). Next, choose az so that 3a2 + 3a)? = 0 


(thus ag = —1/9). Then choose a3 so that the coefficient of N? on the 
right side of the equation above equals 0 (thus ag = 5/81). Continue in this 
fashion for j = 4,...,m—1, at each step solving for a; so that the coefficient 


of NJ on the right side of the equation above equals 0. Actually we don’t 
care about the explicit formula for the a;’s. We need only know that some 
choice of the a;’s gives a cube root of I + N. 

Having shown that the identity plus a nilpotent always has a cube root, 
we now look at the proof that every invertible operator on a complex vector 
space has a square root (see 8.32). In that proof, if we replace the words 
“square root” with “cube root”, we get a proof that every invertible operator 
on a complex vector space has a cube root. 


Suppose T € L(V) is invertible. Prove that there exists a polynomial 
p € P(F) such that TT! = p(T). 


SOLUTION: Let ap + aiz +--+- + Gm—12™! + 2™ denote the minimal 
polynomial of T. This is the monic polynomial of smallest degree such that 


aol +a;T +---+0m—17" ! +T” =0. 


If ag were equal to 0, then we could multiply both sides of the equation 
above by T7! to get 


aul + aol +--+ am- T"? + T™! =0, 


which would give a monic polynomial q of smaller degree such that q(T) = 0, 
contradicting the definition of the minimal polynomial. Thus ag Æ 0. 

Because ag # 0, we can solve the first equation above for the identity 
operator J, getting 


_ 1 
P22 pees ee ee 
ao ag ag 
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Now multiply both sides of the equation above by T—!, getting 


Tos Oly rp eae! Om=1 pm—2 a Ipm- 
ag ao ao ao 
Setting 
| = 1 
PC ee ea ee eit a 
ao ag ag 


we thus have T7! = p(T). 
Give an example of an operator on C? whose minimal polynomial equals z?. 
SOLUTION: Define T € £(C4) by 
T(w1, w2, w3) = (w3,0, 0). 


Clearly T? = 0. In other words, the polynomial z? when applied to T gives 0. 
Thus the minimal polynomial of T is a divisor of z? (by 8.34). But the only 
monic polynomials that divide 2? are 1, z, and z?. The polynomial 1 applied 
to T gives the identity operator, which is not 0, and the polynomial z applied 
to T gives T, which is also not 0. Thus the minimal polynomial of T must 
be 2?. 

Of course there are also many other examples. 


Give an example of an operator on C4 whose minimal polynomial equals 
z(z — 1}. 


SOLUTION: Define T € L(C*) by 
T(w1, we, w3, wa) = (0, we + w4, w3, w4). 
Then 
(T — I)(wi, we, w3, w4) = (—w1, w4, 0,0), 
which implies that 
(T — I)? (w1, we, w3, w4) = (w1,0,0,0), 
which implies that 


T(T — I)? =0. 
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In other words, the polynomial z(z — 1)? when applied to T gives 0. Thus 
the minimal polynomial of T' is a divisor of z(z — 1)? (by 8.34). 

Note that 0 is an eigenvalue of T because T(1,0,0,0) = (0,0,0,0) and 1 
is an eigenvalue of T because T(0,1,0,0) = (0,1,0,0). Thus 0 and 1 must 
both be roots of the minimal polynomial of T (by 8.36). 

The only monic polynomials that divide z(z — 1)? and have 0, 1 as roots 
are z(z — 1) and z(z — 1)”. Because T(T — I) # 0, as is easy to check, this 
implies that z(z — 1)? is the minimal polynomial of T. 

Of course there are also many other examples. 


Suppose V is a complex vector space and T € L(V). Prove that V has a 
basis consisting of eigenvectors of T if and only if the minimal polynomial 
of T has no repeated roots. 


COMMENT: For complex vector spaces, this exercise adds another equiv- 
alence to the list given by 5.21. 


SOLUTION: First suppose that there is a basis (v1, . . . , Un) of V consisting 
of eigenvectors of T. Let 1,...,Am be the distinct eigenvalues of T. Then 
for each vj, there exists 4, with (T — A, J)vj = 0. Thus 


(T = MI)... (T — AmI)v; =0 


for each j (because all the operators in sight commute, for each j the appro- 
priate T — A;,J can be moved to the last position in the product above). An 
operator that sends each vector in a basis to the 0 vector is the 0 operator, 
so 


(THAD) nT =’) =0: 


Thus the polynomial (z — ,)...(z — Am) when applied to T gives 0. Thus 
the minimal polynomial of T is a divisor of (z — Ai)... (z — Am) (by 8.34). 
Because (z — 41)...(z — Am) has no repeated roots, this implies that the 
minimal polynomial of T has no repeated roots, as desired. (Because each 
eigenvalue of T must be a root of the minimal polynomial of T, the minimal 
polynomial of T actually equals (z — A;)...(z — Am).) 

To prove the implication in the other direction, now suppose that the 
minimal polynomial of T has no repeated roots. Letting \,,..., Am denote 
the distinct eigenvalues of T, this means that the minimal polynomial of T 
equals 


(z — Ai)... (z — Am). 
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Thus 
(T — àI)... (T — àm) = 0. 


Let Um be the subspace of generalized eigenvectors corresponding to the 
eigenvalue Am. Recall that Um is invariant under T (see 8.23(b)). Suppose 
v E€ Um. Let u = (T —AnJ)v, so u € Um. The equation above implies that 


(Tlu = àT) eee (Tlu.. a Am-11)u = (T = Ail) alle = Am!)v 
= 0. 


Because (T — AmJ)\u,, is nilpotent (see see 8.23(c)), 0 is the only eigenvalue 
of (T — AmZ)|u,, (this follows from 8.26 and 5.18). Thus Tlu, — A;I is 
invertible (as an operator on Um) for j = 1,...,m — 1. The equation above 
thus implies that u = 0. In other words, v is an eigenvector of T. 

We have shown that every generalized eigenvector of T corresponding to 
the eigenvalue Àm is an eigenvector of T. There is nothing special about the 
eigenvalue A,,—we could have relabeled the eigenvalues so that any of them 
was called Am- Thus we can conclude that every generalized eigenvector of 
T is actually an eigenvector of T. Because there is a basis of V consisting 
of generalized eigenvectors of T (see 8.25), this means that there is a basis 
of V consisting of eigenvectors of T, as desired. 


Suppose V is an inner-product space. Prove that if T € L(V) is normal, 
then the minimal polynomial of T has no repeated roots. 


SOLUTION: Suppose T € L(V) is normal. Let p denote the minimal 
polynomial of T. Suppose à € F is an eigenvalue of T. We can write 


p(z) = (z — A)™q(z) 


where m is a positive integer and q is a monic polynomial such that q(A) £ 0. 
Our goal is to prove that m = 1, which implies that p has no repeated roots. 

Because p(T) = 0, we know that (T — AI)™q(T) = 0. This is equivalent 
to the statement that 


range q(T) C null(T — AJ)™. 


Because T is normal, so is T — AJ, and thus null(T — AJ)™ = null(T — AZ) 
(by Exercise 7 in Chapter 7). Hence the set inclusion above becomes 


range q(T) C null(T — AJ), 
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which implies that (T—AI)q(T) = 0. Letting pi(z) = (z—A)q(z), this means 
that p, is a monic polynomial with the property that p;(T) = 0. If m > 1, 
then the degree of pı would be less than the degree of p, contradicting the 


definition of minimal polynomial. Thus we can conclude that m = 1, as 
desired. 


COMMENT: The proof given above works on both real and complex 
vector spaces. If V is a complex vector space, then this exercise can be done 
by using the complex spectral theorem (7.9) and Exercise 23 of this chapter. 


Suppose T € L(V) and v € V. Let p be the monic polynomial of smallest 
degree such that 


p(T)v = 0. 
Prove that p divides the minimal polynomial of T. 


SOLUTION: Let q denote the minimal polynomial of T. By the division 
algorithm (4.5), there exist polynomials s,r € P(F) such that 


q=sp+r 
and degr < degp. Thus 
Q(T )v = s(T)p(T)v + r(T)v. 


Because q(T) = 0 and p(T)v = 0, the equation above shows that r(T)uv = 
0. This implies that r = 0 (otherwise we could multiply r by a scalar to 
get a monic polynomial with degree smaller than deg p that when applied 
to T gives an operator having v in its null space, which would contract 
the definition of p as the monic polynomial of smallest degree with this 
property). 

Using the information that r = 0, rewrite the formula above for q as 


q = sp. 
Thus p divides q, the minimal polynomial of T. 


Give an example of an operator on C4 whose characteristic and minimal 
polynomials both equal z(z — 1)}?(z — 3). 


SOLUTION: Define T € £(C*) by 


T (w, we, w3, w4) = (0, we + wz, wz, 3w4). 
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An easy computation shows that T(T — I)?(T — 3I) = 0. Thus the 
minimal polynomial of T is a divisor of z(z — 1)?(z — 3) (by 8.34). 

Note that 0 is an eigenvalue of T because T(1,0,0,0) = (0,0,0,0) and 1 
is an eigenvalue of T because T(0, 1,0,0) = (0, 1,0,0) and 3 is an eigenvalue 
of T because T(0,0,0,1) = (0,0,0,3). Thus 0, 1, and 3 must be roots of the 
minimal polynomial of T (by 8.36). 

The only monic polynomials that divide z(z—1)?(z—3) and have 0, 1,3 as 
roots are z(z—1)(z—8) and z(z—1)?(z—3). Because T(T—1)(T—3) Æ 0, as 
is easy to check, this implies that z(z—1)?(z—3) is the minimal polynomial 
of T, as desired. 

Because T is an operator on a four-dimensional complex vector space and 
the minimal polynomial of T has degree 4, the characteristic polynomial of 
T (which is a monic polynomial of degree 4 that is divisible by the minimal 
polynomial of T) must equal the minimal polynomial of T. 

Of course there are also many other examples. 


Give an example of an operator on C* whose characteristic polynomial 
equals z(z— 1)?(z — 3) and whose minimal polynomial equals z(z—1)(z—3). 


SOLUTION: Define T € £(C*) by 
T(wi, W2, W3, wa) = (0, W2, W3, 3w4). 


An easy computation shows that T(T—I)(T—3I) = 0. Thus the minimal 
polynomial of T is a divisor of z(z — 1)(z — 3) (by 8.34). 

Note that 0 is an eigenvalue of T because T(1,0,0,0) = (0,0,0,0) and 1 
is an eigenvalue of T because T(0, 1,0,0) = (0, 1,0,0) and 3 is an eigenvalue 
of T because T(0, 0,0, 1) = (0,0,0, 3). Thus 0, 1, and 3 must be roots of the 
minimal polynomial of T (by 8.36). 

The only monic polynomial that is a divisor of z(z — 1)(z — 3) and has 
0,1,3 as roots is z(z — 1)(z — 3). Thus z(z — 1)(z — 3) is the minimal 
polnoma of T, as desired. 

Note that every vector in {(0, we, w3,0) : w2, w3 € C) is an eigenvector 
of T corresponding to the eigenvalue 1. Thus the eigenvalue 1 of T has 
multiplicity at least 2. The eigenvalues 0 and 3 of T have multiplicity at 
least 1. 

Because T' is an operator on a four-dimensional complex vector space, the 
sum of the multiplicities of all the eigenvalues equals 4 (by 8.18). Thus the 
each use of the phrase “at least” in the previous paragraph can be replaced 
by “equal” because if any of the eigenvalues had larger multiplicity, the sum 
of the multiplicities of all the eigenvalues would exceed 4. 
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Because 0 and 3 are eigenvalues of T with multiplicity 1 and 1 is an 
eigenvalue of T with multiplicity 2 and T has no other eigenvalues (the mul- 
tiplicities of the eigenvalues mentioned already sum to 4), the characteristic 
polynomial of T equals z(z — 1)?(z — 3), as desired. 

Of course there are also many other examples. 


28. Suppose ag,...,@n—1 € C. Find the minimal and characteristic polynomials 
of the operator on C” whose matrix (with respect to the standard basis) is 


0 —ag 
1 0 —ai 
1 ar —a2 

0 —an-2 

l —an-ı 


COMMENT: This exercise shows that every monic polynomial is the 
characteristic polynomial of some operator. 


SOLUTION: Suppose that T € £(C") has matrix as above with respect 
to the standard basis (e1,...,éen) of C”. Thus 


Te, = €? 


Te = Tez = €3 


T” lei = Ten-1 = en 
Te; = Ten = —ag€; — aje — +++ — An—1ên- 


Thus 
(e1, Tei, Tey, tee T" e1) = (e1, €2, €3,--+ En). 


In particular, (e1, Te1, T?e1,..., T” }e1) is linearly independent. Thus if p 
is a monic polynomial with degree less than n, then p(T)e, 4 0. Thus the 
minimal polynomial of T must have degree n. 

Writing T"e, as a linear combination of (e;,Te,,T%e1,...,T"~!e1) is 
possible in only one way: 


Te, = —age, — aTe; — +++ — an-1T" tej. 


Thus setting 
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p(z) = ag + aiz +- + Gn-12"-) +2", 


we see that p is the only monic polynomial of degree n such that p(T)e; = 0. 
Hence p must equal the minimal polynomial of T. 

Because T is an operator on an n-dimensional complex vector space and 
the minimal polynomial of T has degree n, the characteristic polynomial 
of T (which is a monic polynomial of degree n that is divisible by the min- 
imal polynomial of T) must equal the minimal polynomial of T. Thus the 
characteristic polynomial of T also equals p. 


Suppose N € L(V) is nilpotent. Prove that the minimal polynomial of N 
is 2™+!, where m is the length of the longest consecutive string of 1’s that 
appears on the line directly above the diagonal in the matrix of N with 
respect to any Jordan basis for N. 


SOLUTION: Suppose (v1,...,Un) is a Jordan basis for N and that m 
equals the length of the longest consecutive string of 1's that appears on 
the line directly above the diagonal of M(N,(u1,...,Un)). The diagonal of 
ae (vi... vo) contains only 0’s (by Exercise 6 of this chapter). Thus 


M(N, (v1,..-;Un)) is a block diagonal matrix whose blocks have the form 
0 1 0 
l ? 
0 0 


and the largest such block in an (m + 1)-by-(m + 1) matrix. For each vj, 
we see that N™+!y; = 0. Because N™+! equals 0 on a basis of V, we 
conclude that N™+! = 0. Thus the minimal polynomial of N must divide 
z™*1 (by 8.34) and hence must be of the form z* for some k < m +1. But 
there is a basis vector v; such that N™v; = vj-m #0. Thus N™ Æ 0, which 
implies that the minimal polynomial of N equals z™t!, 


Suppose V is a complex vector space and T € L(V). Prove that there 
does not exist a direct sum decomposition of V into two proper subspaces 
invariant under T if and only if the minimal polynomial of T is of the form 
(z — »)4'™VY for some A € C. 


SOLUTION: First suppose that there does not exist a direct sum decom- 
position of V into two proper subspaces invariant under T. Thus T has only 
one eigenvalue (by 8.23), which we will call À. 
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There is a Jordan basis of T (by 8.47), meaning that with respect to this 
basis T. has a block diagonal matrix 


Aj 0 
0 Am 


where each A; is an upper-triangular matrix of the form 


x A 0 
Aj = 

a 1 

0 x 


If m > 1, then we could let U denote the span of the basis vectors corre- 
sponding to A, and let W denote the span of the basis vectors corresponding 
to U2,...,Um; we would have V = U@W, where U and W would be proper 
subspaces of V invariant under T. This contradiction shows that m = 1. 
Thus the matrix of T with respect to our Jordan basis is just A;. In other 
words, there is a basis (v1,...,Udimv) of V such that the matrix of T — AI 
with respect to this basis is 


0 1 0 
1 
0 0 


The previous problem now implies that the minimal polynomial of T — AI 
equals z4'™V_ This clearly implies that the minimal polynomial of T equals 
(z — A)#™1V as desired. : 

To prove the implication in the other direction, suppose that the minimal 
polynomial of T equals (z — A)“". Suppose that there exist two proper 
subspaces U1, U2 of V, each invariant under T, such that V = U; @ Up. 

Let pı denote the minimal polynomial of T|y, and po denote the minimal 
polynomial of T|y,. If u; € U,, then 


(pip2)(T)ui = po(T) p(T) u1 
= 0. 


Similarly, if u2 € U2, then 
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(pıp2)(T)u2 = pı(T)p2(T)u2 
= 0. 


Because every vector in V can be written in the form u; +u2, where uj € Ui 
and ug € Uz, the equations above imply that (pıpa)(T) = 0. Thus the 
minimal polynomial of T, which equals (z — \)*'™Y, is a divisor of Pipe 
(by 8.34). , 

The degree of the monic polynomial pıpz equals the degree of pı plus the 
degree of p2, which is less than or equal to dim U} + dim U2, which equals 
dim V. Because (z — A)*™V is a divisor of pipz, this implies that 


pi(z)p2(z) = (z — ayn. 
This implies that 
pi(z) = (z~ A) and po(z) = (z — A)im 42, 


Let m = max{dim U}, dim U2}. Let p(z) = (z — A)™. The equations above 
show that p(T)|v, = 0 and p(T)|y, = 0. Thus p(T) = 0. Because the degree 
of p is less than dim V, this contradicts our hypothesis that (z—)*™V is the 
minimal polynomial of T. This contradiction means that our assumption 
that there exists a direct sum decomposition of V into two proper subspaces 
invariant under T must have been false, as desired. 


Suppose T € L(V) and (v;,...,0n) is a basis of V that is a Jordan ba- 
sis for T. Describe the matrix of T with respect to the basis (Un,.--;U1) 
obtained by reversing the order of the v’s. 


SOLUTION: The matrix of T with respect to (u,...,Un) is a block di- 
agonal matrix 


Ay 0 


where each A; is an upper-triangular matrix of the form 
Aj 1 0 


Aj = 
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Temporarily fix j, and let (u1,..., up) be the part of (v1, . . . , Un) correspond- 
ing to the block 4;. Thus 


Tuy = Ajur 
Tuz = u, + Àju2 
Tug = u + Aju3 


Tug = Up—1 + Aju. 
Write the equations above in the form 


Tup = ÀjUk + Uk-1 
Tuk- = AjUp—1 + Uk-2 


Tuz = Aju2 + uy 


Tuy = Aju. 
Thus the matrix of T|span(uz,...,u,) With respect to (uk, -.-., u1) is 
Àj 0 
Ba? 
0 1 Aj 


In other words, B; is obtained from A; by reflection across the diagonal, so 
in B; the 1’s lie below the diagonal instead of above it. Now we see that the 
matrix of T with respect to (vn,---,01) is the block diagonal matrix 


Bm 0 


0 Bı 


CHAPTER 9 


Operators on . 
Real Vector Spaces 


Prove that 1 is an eigenvalue of every square matrix with the property that 
the sum of the entries in each row equals 1. 


SOLUTION: Suppose that A is an n-by-n matrix such that the sum of 
the entries in each row of A equals 1. Let x be the n-by-1 matrix all of 
whose entries equal 1. Then from the definition of matrix multiplication we 
see that the entry in row j, column 1 of Ar equals that sum of the entries 
in row j of A, which equals 1. Thus As = x, which implies that 1 is an 
eigenvalue of A. 


Consider a 2-by-2 matrix of real numbers 
ac 
Ass | ae | 
Prove that A has an eigenvalue (in R) if and only if 
(a — d}? + 4bc > 0. 


SOLUTION: A number à € R is an eigenvalue of A if and only if there 
exist numbers z,y € R, not both 0, such that 


Le allle] 


The left side of this equation equals [ bea. 


dy ], so the equation above is 
equivalent to the system of equations 


113 
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(a — A)£ + cy =0 
bz + (d—-A)y = 0. 


It is easy to see that this system of equations has a solution other than 
x = y = 0 if and only if 


(a — A)(d — A) = be, 
which is equivalent to the equation 
X? — (a + d)A + (ad — bc) = 0. 
There is a real number A satisfying the equation above if and only if 
(a +d}? — 4(ad — bc) > 0. 


The left side of the inequality above equals (a — d)? + 4bc, and thus we 
conclude that A has a real eigenvalue if and only if 


(a — d)? + 4bc > 0. 
3. Suppose A is a block diagonal matrix 


Ay 0 
A= we A 
0 Am 


where each A; is a square matrix. Prove that the set of eigenvalues of A 
equals the union of the eigenvalues of A,,..., Am. 


SOLUTION: Suppose that A; has size n;-by-n;. Let 


Tı 


Tm 
where each x; is an nj-by-1 matrix. Then 
Azı 
Ar = : 

Anim 


Thus the equation Ax = Az is equivalent to the system of equations 
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Áızı = AD) 


Anita Afm- 


Suppose that À is an eigenvalue of A. Then there is a nonzero vector x 
satisfying the system of equations above. Because z is nonzero, there exists 
k such that rz is nonzero. Because Azzy, = Arg, this implies that À is an 
eigenvalue of Ay. Thus À is in the union of the eigenvalues of Aj,...,Am, 
as desired. 

To prove the implication in the other direction, suppose now that A is in 
the union of the eigenvalues of Aj,..., Am- Then there exists k such that A 
is an eigenvalue of Ag. Thus there exists a nonzero nx-by-1 vector x, such 
that A,r, = Azp. For j # k, define zj to be the n;-by-1 matrix all of whose 
entries equal 0, and define x to be the matrix determined by 21,...,2m as 
above. Then z is nonzero (because x is nonzero) and Ar = Az, which 
shows that À is an eigenvalue of A, as desired. 


4. Suppose A is a block upper-triangular matrix 
Ay * 


0 Åm 
where each A; is a square matrix. Prove that the set of eigenvalues of A 
equals the union of the eigenvalues of A;,...,Am- 


COMMENT: Clearly Exercise 4 is a stronger statement than Exercise 3. 
Even so, students may want to do Exercise 3 first because it is easier than 
-Exercise 4. 


SOLUTION: We will prove that 0 is an eigenvalue of A if and only if 0 is 
an eigenvalue of at least one of the A;’s. This is all we need to do, because 
for arbitrary À € F, we can replace A with A— XI and each Ax with Ak—AI, 
concluding that A is an eigenvalue of A if and only if À is an eigenvalue of at 
least one of the A;’s. This last statement implies that the set of eigenvalues 
of A equals the union of the eigenvalues of Aj,...,Am- 

Suppose that A has size n-by-n and each A; has size n;-by-n;. We can 
write a typical n-by-1 matrix z in the form 


Tı 
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where each z; is an n;-by-1 matrix. The product Ar can be computed by 
multiplying together the block matrices of A and z given above, using the 
same formula as one would use when multiplying matrices of numbers (this 
follows from the definition of matrix multiplication). 

First suppose that 0 is an eigenvalue of A. Thus there exists a nonzero 
n-by-1 matrix x such that Ax = 0. Write z in the form above, and let k be 
the largest index with nonzero s; thus we can write 


Ti 


(If k = m, then the 0’s shown above at the tail of z do not appear.) If we 
break Az into blocks of the same size as was done for x, then the kt? block 
of Az will equal Az2;; this follows from the block upper-triangular form of 
A and the 0’s that appear in z after the kt? block. But Ax = 0, so the kt? 
block of Ax equals 0, so Azp = 0. Because zy, # 0, this implies that 0 is 
an eigenvalue of Ag, as desired. 

To prove the implication in the other direction, suppose now that 0 is an 
eigenvalue of some Ap. This means that the operator on Mat(n,1,F) (the 
vector space of n-by-1 matrices) that sends x; E€ Mat(n,1,F) to Aga is not 
injective. Thus this operator is not surjective (by 3.21). Thus the operator 
on Mat(n; + ---+ nk, 1, F) that sends 


Ti 
Tk 
to 
Aj * Tı 
0 Ax Tk 


is not surjective (because the last block in the product above will be AkTk, 
which cannot be an arbitrary n,-by-1 matrix). Again using 3.21, this means 
that the last operator is not injective. In other words, there exists a nonzero 
vector 
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Tı 
€ Mat(ni +--+ nk, 1, F) 
Tk 
such that 
Ay * Tı 
ba : | =0. 
0 Ax Tk 


Adjoining an appropriate number of 0’s, this implies that 


Aj * Tı 
Ax Tk | o 
Ag+ 0 ` 

0 Am 0 


In other words, 0 is an eigenvalue of A, as desired. 


5. Suppose V is a real vector space and T € L(V). Suppose a,f € R are 
such that T? + aT + GI =0. Prove that T has an eigenvalue if and only if 
a? > 48. 


SOLUTION: First suppose that T has an eigenvalue \ € R. Thus there 
exists a nonzero vector v € V such that Tv = Av. Applying T to both sides 
of the last equation, we get T?v = \?v. Thus 


= (T? +aT + BI)v 

= Av + adv + pv 

= (A? +ad + B)v. 
Because v # 0, the last equation implies that 

àX? +ar(.+ B=0, 


which implies (recall that À, a, and £ are all real) that a? > 42, as desired. 

To prove the implication in ons opposite direction, suppose now that 
a? > 48, Then the polynomial z? + ar + B has two real roots, which means 
that we can write 
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T? +ar +p = (z -—ri)(z — ro) 
for some T1,T2 E R. Thus 


0=T?+aT +l 
= (T -rı I)(T — rel). 


In particular, (T — rı )(T — rI) is not injective, which implies that at least 
one of T — rI and T — ral is not injective. In other words, at least one of 
rı, T2 must be an eigenvalue of T. Thus T has an eigenvalue, as desired. 


6. Suppose V is a real inner-product space and T € £(V). Prove that there 
is an orthonormal basis of V with respect to which T has a block upper- 
triangular matrix 


A; * 
0 Am 
where each A; is a 1-by-1 matrix or a 2-by-2 matrix with no eigenvalues. 


SOLUTION: We know that there is a basis (v1,..., un) of V with respect 
to which the matrix of T has the block upper-triangular form above, where 
each A; is a 1-by-1 matrix or a 2-by-2 matrix with no eigenvalues (see 9.4). 

Apply the Gram-Schmidt procedure to (v1,..., Un), getting an orthonor- 
mal basis (e1,...,@n) of V such that 


span(v;,...,v;) = span(e1,...,¢€;) 


for j = 1,...,n (see 6.20). This condition on the spans implies that the 
matrix of T with respect to (e1,...,€n) is also a block upper-triangular of 
the form above, where each A, is a 1-by-] matrix or a 2-by-2 matrix (these 
Aj’s may differ from the A;’s corresponding to the matrix of T with respect 


to (v1, ---,Un)). All that remains is to show that, if necessary, we can modify 
our orthonormal basis so that none of the 2-by-2 blocks on the diagonal have 
eigenvalues. 


Suppose that A; is a 2-by-2 block on the diagonal of the matrix of T with 
respect to (e1,..., €n) and that A; has an eigenvalue À. Thus there exist 
z,y E R, not both 0, such that 
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Let €;,€x41 denote the basis vectors corresponding to A;. Then, as is easy 
to verify, 


T (rex + yersi) = ut A(zex + yery) 


for some u € span(e},...,€%-1). Let 
_ ee + Yek+ 
fk = OR ae | 
llzer + yer+i| 


and choose f,41 € V such that (fk, fk+1) is orthonormal and 


span( fx, fk+1) = span(ex, €k+1)- 


Then the matrix of T with respect to the orthonormal basis 


(e1, eee s€k-1 Shes feats Ck425 o. gen) 


will be a block upper-triangular matrix with the same entries on the diagonal 
as previously, except that A; will be replaced by the 2-by-2 matrix 


À * 

O «*«|° 
In other words, where we had A; on the diagonal, we can now think of 
having two 1-by-1 matrices on the diagonal (and we still have a block upper- 
triangular matrix because of the 0 in the lower-left entry of the matrix 


above). 
Repeating, when necessary, the procedure described above, we obtain an 


-orthonormal basis of V with respect to which T has a block upper-triangular 


matrix 
Ai * 
0 Am 
where each A; is a 1-by-1 matrix or a 2-by-2 matrix with no eigenvalues. 


Prove that if T € L(V) and j is a positive integer such that j < dim V, then 
T has an invariant subspace whose dimension equals j — 1 or j. 


SOLUTION: Suppose T € L(V) and j is a positive integer such that 
j < dim V. If V is a complex vector space, then T has an invariant subspace 
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whose dimension equals j (by Exercise 17 in Chapter 5). So we can assume 
that V is a real vector space. 

By 9.4, there is a basis (v1,...,0,) of V with respect to which T has a 
block upper-triangular matrix 


A) * 
0 Am 
where each A, is a 1-by-1 matrix or a 2-by-2 matrix. Either vj or vj must 


be the last vector in a block of vectors corresponding to some Ap. Thus 
either span(vj,...,vj—1) or span(v;,...,v;) must be invariant under T. 


Prove that there does not exist an operator T € C(R*) such that T? +T +I 
is nilpotent. 


SOLUTION: Suppose T € £(R”). From part (b) of 9.9 (combined with 
9.4, which insures that T has a matrix of the form 9.10 with respect to some 
basis of R7), we see that 


dim null(T? +T + 1)? 


must be an even integer. In particular, dim null(T? +T + I)" 4 7, which 
implies that null(T? +T + 1)’ # R7, which implies that (T? +T +)" £0, 
which implies that T is not nilpotent (by 8.8). 


Give an example of an operator T € £(C’) such that T?+T'+1 is nilpotent. 
SOLUTION: Let A = (—1+ 3%)/2. Define T € £(C’) by T = XI. Then 


T?4+T4+I=(2+A41)1 
=0. 


In particular, T? + T + J is nilpotent. 


Suppose V is a real vector space and T € L(V). Suppose a, 8 € R are such 
that a? < 48. Prove that 


null(T? + aT + AI): 
has even dimension for every positive integer k. 


SOLUTION: Let k be a positive integer, and let U = null(T?+aT+1I)*. 
We need to prove that dim U is even. 
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Because U is invariant under T (by 8.22, with p(x) = (xz? +az+)*), we 
can define S € L(U) by S = Tly. Clearly (S? + aS + BI) is the 0 operator 
on U. Thus S? + aS + BI is a nilpotent operator on U, which implies that 
null(S? +aS + BI)%™Y =U (by 8.8). Now part (b) of 9.9, applied to S and 
U instead of T and V, shows that dim U is an even integer, as desired. 


Suppose V is a real vector space and T € L(V). Suppose a, € R are such 
that a? < 48 and T? + aT + BI is nilpotent. Prove that dim V is even and 


(T? +aT +V = 0, 


SOLUTION: Let S = T?+aT+I. Because S is nilpotent, there is a 
smallest positive integer m such that S™ = 0. Thus 


{0} = null S? C nullSc nullS?c ...¢ null S™ = V, 
F + FOC E 


where the proper inclusions come from 8.5. The previous exercise states 
that each null S* has even dimension (in particular V, which equals null S™, 
has even dimension). Hence the dimension must increase by at least 2 in all 
the proper inclusions above. Thus dim S™ > 2m, which clearly implies that 
m < (dim V)/2. Because S™ = 0, this implies that S4™V/? — 0, as desired. 


Prove that if T € £(R*) and 5,7 are eigenvalues of T, then T has no 
eigenpairs. 


SOLUTION: Suppose that T € £(R3) and 5,7 are eigenvalues of T. Of 
course each of these two eigenvalues must have multiplicity at least 1. By 
9.17, the sum of the multiplicities of all the eigenvalues of T plus twice the 
sum of the multiplicities of all the eigenpairs of T equals 3. Because the sum 


of the multiplicities of all the eigenvalues of T is at least 2, there is no room 


for an eigenpair, which would add at least 2 more to the sum (because we 
take twice the sum of the multiplicities of all the eigenpairs). Thus T has 
no eigenpairs. 7 


Suppose V is a real vector space with dim V = n and T € L(V) is such that 
null T”? Æ null T”, 


Prove that T has at most two distinct eigenvalues and that T has no eigen- 
pairs. 


SOLUTION: Because nullT™~? Æ null T”-!, we see that dim null TJ is 
at least 1 more than dim null TI-! for j = 1,...,n — 1 (by 8.5). Thus 
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dim null T”! > n — 1. In particular, 0 is an eigenvalue of T with mul- 
tiplicity at least n — 1. Because the sum of the multiplicities of all the 
eigenvalues of T plus the sum of twice the multiplicities of all the eigenpairs 
of T equals n (by 9.17), this implies that T can have no eigenpairs and at 
most one additional eigenvalue. 


Suppose V is a vector space with dimension 2 and T € L(V). Prove that if 


[è a] 


is the matrix of T with respect to some basis of V, then the characteristic 
polynomial of T equals (z — a)(z — d) — bc. 


COMMENT: As usual unless otherwise specified, here V may be a real 
or complex vector space. 


SOLUTION: Let q(z) = (z — a)(z — d) — bc. If (v1, v2) is the basis of V 
with respect to which T has the matrix above, then 


Tv, = avı +buv2 and Tu= cv; + dw. 


From these equations you can easily verify that q(T)v,; = 0 and q(T)v = 0. 
Because q(T) is 0 on a basis of V, we conclude that q(T) = 0. 

Because q is a monic polynomial of degree 2 and q(T) = 0, we conclude 
that the minimal polynomial of T has degree 1 or 2. 

Suppose first that the minimal polynomial of T has degree 2. Because 
the minimal polynomial of T is a divisor of g (by 8.34), and because a monic 
polynomial of degree 2 can be a divisor of another monic polynomial of 


' degree 2 only if the two polynomials are equal, we conclude that q is the 


minimal polynomial of T. The Cayley-Hamilton theorem now implies that 
q is a divisor of the characteristic polynomial of T, which is also a monic 
polynomial of degree 2. This implies that, q is the characteristic polynomial 
of T, as desired. 

Now consider the only remaining possible case, which is that the minimal 
polynomial of T has degree 1, meaning that it equals z — À for some A € F. 
This implies that T = AJ, which implies that the characteristic polynomial 
of T equals (z—A)*. Because T = J, we must have a = d = A and b =c=0. 
Thus g(z) = (z — )*. In particular, q is the characteristic polynomial of T, 
as desired. 


COMMENT: Note that we did not need to find the eigenvalues of T to 
do this exercise. 
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Suppose V is a real inner-product space and S € L(V) is an isometry. Prove 
that if (a, 8) is an eigenpair of S, then @ = 1. 


SOLUTION: There is a basis of V with respect to which S has a block 
diagonal matrix, where each block on the diagonal is a 1-by-1 matrix or a 
2-by-2 matrix of the form 


cos@ -sin 
sinô cosé |’ 


with @ € (0,7) (see 7.38). The characteristic polynomial of the matrix above 
is (x — cos 0)? + sin? 6, which equals x? — 2(cos6)zx + 1. 
If (a, 8) is an eigenpair of S, then 


dim null(S? + aS + BI)%™Y > 0, 


which implies that x? + az + £ is the characteristic polynomial of a 2-by-2 
matrix of the form displayed above (see 9.9). Thus £ = 1. 


CHAPTER 10 


Trace and Determinant 


Suppose that T € L(V) and (u1,...,0n) is a basis of V. Prove that 
M(T, (v1,.-- ,Un)) is invertible if and only if T is invertible. 


SOLUTION: First suppose that M(T) is an invertible matrix (because the 
only basis is sight is (v1 ,..., un), we can leave the basis out of the notation). 
Thus there exists an n-by-n matrix B such that 


M(T)B = BM(T) =I. 


There exists an operator S € L(V) such that M(S) = B (see 3.19). Thus 
the equation above becomes 


M(T)M(S) = M(S)M(T) = J, 
which we can rewrite as 
M(TS) =M(ST) = M(I), 
which implies that 
TS =ST =I. 
Thus T is invertible, as desired, with inverse S. 
To prove the implication in the other direction, suppose now that T is 
invertible. Thus there exists S € L(V) such that 
TS =ST =I. 


This implies that 
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M(TS) = M(ST) = M(I), 
which implies that 
M(T)M(S) = M(S)M(T) = I. 
Thus M(T) is invertible, as desired, with inverse M(S). 


Prove that if A and B are square matrices of the same size and AB = I ; 
then BA = I. 


SOLUTION: Suppose that A and B are n-by-n matrices and AB = I. 
There exist S,T € £(F") such that 


M(S)=A and M(T) = B; 


here we are using the standard basis of F” (the existence of S,T € L(F") 
satisfying the equations above follows from 3.19). Because AB = I, we have 
M(S)M(T) = I, which implies that M(ST) = M(I), which implies that 
ST = I, which implies that TS = I (by Exercise 23 in Chapter 3). Thus 


BA=M(T)M(S) 
= M(TS) 
= M(1) 
ST. 


Suppose T € £(V) has the same matrix with respect to every basis of V. 
Prove that T is a scalar multiple of the identity operator. 


SOLUTION: We begin by proving that (v, Tv) is linearly dependent for 
every v € V. To do this, fix v € V, and suppose that (v, Tv) is linearly 
independent. Then (v,Tv) can be extended to a basis (v, Tu, uy,...,Un) 
of V. The first column of the matrix of T’ with respect to this basis is 


0 

1 

0 

0 
Clearly (2v,Tv,u1,...,uUn) is also a basis of V. The first column of the 
matrix of T with respect to this basis is 
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. © 


0 
Thus T has different matrices with respect to- the two bases we have consid- 
ered. This contradiction shows that (v, Tv) is linearly dependent for every 
v E€ V. This implies that for every vector in V is an eigenvector of T. This 


implies that T is a scalar multiple of the identity operator (by Exercise 12 
in Chapter 5). 


Suppose that (u1, ..., Un) and (v1, ..., Un) are bases of V. Let T € L(V) be 
the operator such that Tug = uz for k = 1,...,n. Prove that 


M(T, (v1,.--,0n)) = M(J, (u1, - -<3 Un), (V1: -++ Un): 


SOLUTION: Fix k. Write 


uk = aivi + aye +4nUn; 


where a),...,@n € F. Because Tu, = ug, the kt column of the matrix 
M(T, (uv, --. ,Un)) consists of the numbers a1,...,@n. Because Iu, = ug, 
the kt? column of M (I, (u1,---;Un); (v1,---,Un)) also consists of the num- 
bers a1,...,Qn. 


Because M (T, (v1,..., Un)) and M(I,(u1,.--,%n),(v1,---, Un)) have the 
same columns, these two matrices must be equal. 


` Prove that if B is a square matrix with complex entries, then there exists 


an invertible square matrix A with complex entries such that A~!BA is an 
upper-triangular matrix. 


SOLUTION: Suppose B is an n-by-n matrix with complex entries. Let 
(e1,..-,€n) denote the standard basis of C”. There exists T € £(C") such 
that M(T, (e1,.--,en)) = B (see 3.19). 

There is a basis (v,,..., Un) of V such that M(T, (u1,..., Un)) is an upper- 
triangular matrix (see 5.13). Let A = M((u1,..., Vn), (e1,--. ,€n))- Then 
A is invertible (by 10.2) and 


A" BA=A™"'M(T,(e1,.--,€n))A 
= M(T, (vi, ,Un)), 


127 


CHAPTER 10. Trace and Determinant 


where the second equality comes from 10.3. Thus ATIBA is an upper- 
triangular matrix. 


Give an example of a real vector space V and T € L(V) such that 
trace(T”) < 0. 
SOLUTION: Define T € £(R?) by 
T(z,y) = (-y, 2). 
Then T? = —I, so trace(T?) = —2. 


Suppose V is a real vector space, T € L(V), and V has a basis consisting of 
eigenvectors of T. Prove that trace(T?) > 0. 


SOLUTION: Let (u1,...,Un) be a basis of V consisting of eigenvectors 
of T. Thus there exist 41,...,An € R such that Tv; = A;v; for each j. 
Clearly the matrix of T? with respect to the basis (v1,..., Un) is the diagonal 
matrix 


M? 0 


Thus trace T? = A? +---+ An? > 0. 


Suppose V is an inner-product space and v,w € L(V). Define T € L(V) by 
Tu = (u,v)w. Find a formula for traceT. 


SOLUTION: First suppose that v Æ 0. Extend (,%,) to an orthonormal 
Tell 


basis (Tp er .++,€n) of V. Note that for each j, we have Te; = 0 (because 


(ej v) = 0). The trace of T equals the sum of the diagonal entries in the 
matrix of T with respect to the basis (TP ED ---,€n). Thus 


v 


loll 


= (aT v)w, Pe 


trace T = ro) ) + (Tey,e1) +-+- + (Ten, en) 


= (w,v). 


If v = 0, then T = 0 and so traceT = 0 = (w,v). Thus we have the 
formula 
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traceT = (w,v) 
regardless of whether or not v = 0. 


Prove that if P € L(V) satisfies P? = P, then trace P is a nonnegative 
integer. 


SOLUTION: Suppose that T € L(V) satisfies P? = P. Let (u1,..., Um) 
be a basis of range P and let (v1,..., Un) be a basis of null P. Then 


(Uis... Um, V1, -- -3 Un) 


is a basis of V (this holds because V = rangeT @ null T; see Exercise 21 
in Chapter 5). For each u; we have Pu; = uj and for each vz we have 
Pu; = 0. Thus the matrix of P with respect to the basis above of V is 
a diagonal matrix whose diagonal contains m 1’s followed by n 0’s. Thus 
trace P = m, which is a nonnegative integer, as desired. In fact, we have 
shown that 


trace P = dim range P. 
Prove that if V is an inner-product space and T € L(V), then 
trace T* = traceT. 


SOLUTION: Suppose that V is an inner-product space and T € L(V). 
Let (€1,...,€n) be an orthonormal basis of V. The trace of any operator on 
V equals the sum of the diagonal entries on the matrix of the operator with 


respect to this basis. Thus 


trace T* = (T*e,,e1) +--+ + (T*en, €n) 
= (e1,Te1) +-+- + (en, Ten) 
= (Tei, 61) +--+ (Ten, en) 
= (Te1,e1) +++ + (Tens en) 
= traceT. 


Suppose V is an inner-product space. Prove that if T € L(V) is a positive 
operator and trace T = 0, then T = 0. 


SOLUTION: Suppose T € L(V) is a positive operator and traceT = 0. 
There exists an operator S € L(V) such that T = S*S (by 7.27). Let 
(€1,-..,@n) be an orthonormal basis of V. Then 
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0 = trace T 
= (Te1,61) +-+- + (Ten, €n) 
= (S*Se,,e1) +--- + (S* Sen, en) 
= |[Ser||? +--+ + [[Senl]?. 


The equation above implies that Se; = 0 for each j. Because S is 0 on a 
basis of V, we have S = 0. Because T = S*S, this implies that T = 0. 


Suppose T € £(C%) is the operator whose matrix is 


51 —12 —21 
60 —40 -28 |. 
57 -68 1 


Someone tells you (accurately) that —48 and 24 are eigenvalues of T. With- 
out using a computer or writing anything down, find the third eigenvalue 
of T. 


SOLUTION: The sum of the eigenvalues of T equals the sum of the 
diagonal terms of the matrix above (both quantities equal traceT). The 
sum of the diagonal terms of the matrix above equals 12. The sum of two of 
the eigenvalues of T, —48 and 24, equals —24. Because the sum of all three 
eigenvalues of T must equal 12, the third eigenvalue of T must be 36. 


Prove or give a counterexample: if T € L(V) and c € F, then trace(cT) = 
ctraceT. 


SOLUTION: Suppose T € L(V) and c € F. To prove that trace(cT) = 
ctraceT’, consider a basis of V. Then trace T equals the sum of the diagonal 
terms of the matrix of T with respect to this basis. The matrix of cT, with 
respect to the same basis, equals c times the matrix of T. Thus the sum 
of the diagonal terms of the matrix of cT equals c times the sum of the 
diagonal terms of the matrix of T. In other words, trace(cT) = ctraceT. 


Prove or give a counterexample: if S,T € L(V), then 
trace(ST) = (trace S)(trace T). 
SOLUTION: Define S,T € L(F?) by S(x,y) = T(z,y) = (—y, 1). Then 


with respect to the standard bases the matrix of S (which of course equals 
the matrix of T) is 
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0 —1 
1 0 |’ 
Thus trace S = traceT = 0. However, ST = —I, so trace ST = —2. Thus 


for this choice of S and T, we have trace(ST) # (trace S) (trace T). 
Of course there are also many other examples. 


Suppose T € L(V). Prove that if trace(ST) = 0 for all S € L(V), then 
T=0. 


SOLUTION: Suppose that trace(ST) = 0 for all S € L(V). Then 
trace(T'S) = 0 for all S € L(V) (by 10.12). Suppose that there exists v € V 
such that Tv # 0. Then (Tw) can be extended to a basis (Tv, u1,..., Un) 
of V. Define S € L(V) by 


S(aTv + biui +--+ + bnun) = av. 


Thus S(Tv) = v and Su; = 0 for each j. Hence (TS)(Tv) = T(S(Tv)) = Tv 
and (T'S)(u;) = 0 for each j. This implies that with respect to the basis 
(Tv, ui,...,Un), the matrix of TS consists of all 0’s except for a 1 in the 
upper-left corner. Thus trace(TS) = 1. This contradiction shows that our 
assumption that Tv 4 0 must have been false. Thus Tw = 0 for every v E€ V, 
which means that T = 0. 


Suppose V is an inner-product space and T € L(V). Prove that if (e),..., €n) 
is an orthonormal basis of V, then 


trace(T*T) = [Tex ||? +--+ |Ten||. 


| Conclude that the right side of the equation above is independent of which 


orthonormal basis (€1,...,é€n) is chosen for V. 


SOLUTION: Suppose that (e1,...,€n) is an orthonormal basis of V. Then 


trace T*T = (T*Te,,e1) +--+ (T*T en, en) 
= (Te;,Te,) +--+ + (Ten, Ten) 
= |Tel? +--+ + {|Tenl|?. 
Because traceT*T does not depend upon the choice of a basis of V, 


the formula above shows that ||Tei||? +---+ ||Ten|l? is independent of the 
orthonormal basis (e1,..., €n). 
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Suppose V is a complex inner-product space and T € L(V). Let Ay,...,An 
be the eigenvalues of T, repeated according to multiplicity. Suppose 


Qll ++ Qin 


Qn ++. Ann 
is the matrix of T with respect to some orthonormal basis of V. Prove that 
n n 
Al? Heet lAn? < SOY lajt. 
k=] j=1 


SOLUTION: Suppose that (e1,..., €n) is the orthonormal basis with re- 
spect to which T has the matrix above. Thus for each k, we have 


Tek = @1,h€1 + +++ + anken, 
which implies that 
I[Texll? = jar? + +++ + [an,el?. 
Thus 
n n 
Teil? +--+ [|Tenll? = Y Jazz}. 
k=] j=1 


By the previous exercise, the left side of this equation equals trace(T*T). 
This reduces the exercise at hand to proving that 


[Ai]? +++++ [An]? < trace(T*T). 


There is an orthonormal basis (fı,... fn) with respect to which T has 
an upper-triangular matrix (by 6.28). The diagonal entries of the matrix of 
T with respect to (fi,..., fn) are precisely 41,...,An (by 8.10), where we 
can relabel the eigenvalues of T so that they appear in the order \j,... sÀn 
along the diagonal. In other words, 


` * 
M(T, (fi,---sfn)) = 


From the matrix above, we see that 
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IAk? < IT fell? 
for each k. Thus 


An? +--+ Anl? < ITAN? +--+ + T fali? 
= trace(T*T), 


as desired (here the last equality comes from ‘the previous exercise). 
Suppose V is an inner-product space. Prove that 

(S,T) = trace(ST*) 
defines an inner product on L(V). 


SOLUTION: Suppose that (-,-) is defined as above and R,S,T € L(V). 
Then (T,T) = trace(TT*), and thus (T, T} > 0 (by the formula given in 
Exercise 16 of this chapter, with T replaced with T*). Because TT* is a 
positive operator (see 7.27), we also see that (T, T} = 0 if and only if T = 0 
(by Exercise 11 of this chapter). 

Now 


(R+S,T) = trace((R + S)T*) 
= trace(RT* + ST*) 
= trace(RT*) + trace(ST*) 
= (R,T) + (S,T), 
where the third equality comes from 10.12. 
For c € F, we have 
(cS, T) = trace(cST*) 
= ctrace(ST*) 
=c(S,T), 


where the second equality comes from Exercise 13 of this chapter. 
Finally, 


(S,T) = trace(ST*) 
E] 
= trace(TS*) 
= (S,T}, 


133 


19. 


CHAPTER 10. Trace and Determinant 


where the second equality comes from Exercise 10 of this chapter. 
We have shown that (-,-) satisfies all the properties required-of an inner 
product. 


COMMENT: Suppose (é1,...,€n) is an orthonormal basis of V and 


Qll --- Qin 


Qn,1 +--+ Ann 
is the matrix of T with respect to this basis. Then 


(T,T) = trace(TT*) 
= trace(T“T) 
n n i 
= 5 >> la;kl?, 
k=1 j=1 


where the second equality comes from 10.12 and the third equality comes 


‘from Exercise 16 of this chapter. Thus the norm on L(V) induced by (-,-) 


is the same as the standard norm on F™ (here we are identifying each 
operator with its matrix, which has n? entries). Because norms determine 
the inner product (see Exercises 6 and 7 in Chapter 6), this means that the 
inner product (-,-) is the same as the standard inner product of F”? (again 
using the identification via matrices with respect to the orthonormal basis 
(e1,---,€n))- 


Suppose V is an inner-product space and T € L(V). Prove that if 


I[T*ul] < [ITI 


for every v € V, then T is normal. 


SOLUTION: Suppose that ||T*u|| < ||Tv]| for every v € V. Suppose 
u € V with ||ul| = 1. Extend (u) to an orthonormal basis (u, e1,...,€n) 
of V. Then 


trace(TT™) = ||T*ul|? + ||T*e1 |? +--- + lT“ enll? 
< |[Tull? + |Tel? +--+ + ITenll? 
= trace(T*T) 
= trace(TT*), 
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where the first and third lines come from Exercise 16 of this chapter and 
the last line comes from 10.12. Because the first and last lines above are 
equal, we must have equality throughout. Thus ||T*u|| = ||T'u||. This clearly 
implies that 


IT" (au)l| = I7(eu)l 


for every a E€ F. Because every vector in V can be written in the form au for 
some a € F and some u € V with ||u|| = 1, this implies that |/T*v|| = ||T || 
for every v € V. This implies that T is normal (by 7.6). 


COMMENT: This exercise fails on infinite-dimensional inner-product 
spaces, leading to what are called hyponormal operators, which have a well- 
developed theory. 


Prove or give a counterexample: if T € L(V) and c € F, then det(cT) = 
ctimV det T. 


SOLUTION: Let n = dim V. If A is an n-by-n matrix, then 
det(cA) = c" det A 


for every c € F (this follows immediately from the definition 10.25). Now 
suppose that T € L(V). Because det T equals the determinant of the matrix 
of T with respect to any basis (see 10.33), the equation above implies that 
for every c € F we have 


det(cT) = det M(cT) 
= det(cM(T)) 
= c” det M(T) 
= c" det T, 


as desired. 
Prove or give a counterexample: if S,T € L(V), then 
det(S + T) = det S + det T. 
SOLUTION: Define S, T € L(F?) by 
S(z,y) = (z,0) and T(z,y)= (0,y). 


Then S and T are both not invertible. Thus det S = det T = 0 (by 10.14). 
However, S +T = I and det I = 1, so det(S + T) # det S + det T. 
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Of course there are also many other examples. 
Suppose A is a block upper-triangular matrix 


Ay * 
A= a ’ 
0 An 


where each A; along the diagonal is a square matrix. Prove that 
det A = (det A1)...(det Am). 


SOLUTION: First consider the case where m = 2, so we can write A in 
the form 


where B is an n-by-n matrix 


bi, Bin 
B= : 
bn 1 . Ban 
and C is a p-by-p matrix 
Cll --- Clp 
C=) : 
Cpl +++ Cpp 


Let a; denote the entry in row j, column k of A. Note that ajk =Oifj >n 


and k < n (this follows from the block upper-triangular form of A). Because 
A is an (n + p)-by-(n + p) matrix, to compute det A we need to consider a 
typical permutation (mj,...,Mnsp) E perm(n + p). If any of m),..., mn is 
greater than n, then 


Gyles Omatp.ntp = 0. 


Thus in computing det A we need only consider permutations in which the 
first n coordinates all come from {1,...,n}, which means that the last p 
coordinates all come from {n + 1,...,n +p}. We can break any such per- 
mutation (m),...,Mnp+4p) into two pieces: a permutation of {1,...,n} and 
(relabeling n + 1,...,n +p as 1,...,p to correspond to the labeling of the 


136 CHAPTER 10. Trace and Determinant 


entries of C) a permutation of {1,...,p}. Clearly the sign of (mj,..., n+p) 
will equal the product of the signs of these two permutations. Putting all 
this together, we have 


det A = 3: (siga(mi,..., Mn+p))Om,,1 +++ Omagy tp 
(Mi ,...Mn+p)Eperm(n+p) 


= D [(sign(j1, -+ jn)) (sign(k1,..., kp)) 


(i1,--.jn)€permn 
(k1,....kp)€perm p 


bj,,1 se Djn nCky 1 ve Ckp.p] 
= SS Gign(fi.-..s5n))bia-+ Pian’ 
(j1,.-..Jn)€permn 
5  (sign(ki,..-, kp))cki1--- Ckp,p 
(k1,...,.kp)€perm p 


= (det B)(det C), 


completing the proof when m = 2. 
Suppose now that m > 2 and 


Aj * 
A= i 
0 Ai 
Writing 
Aj * 
B= i 
0 Am-1 
we have 
Bx 
AS | 0 Am | 
Thus 


det A = (det B)(det Am) 
= (det A;)... (det Am-1)(det Am), 
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where the first equality holds by the m = 2 case proved above and the 
second equality comes from induction on m (meaning that we can assume 
the desired result holds when m is replaced with m — 1). 


Suppose A is an n-by-n matrix with real entries. Let S € £(C") denote 
the operator on C” whose matrix equals A, and let T' € L(R”) denote the 
operator on R” whose matrix equals A. Prove that trace S = traceT and 
det S = det T. 


SOLUTION: The formulas defining the trace and determinant of a matrix 
do not depend upon whether we think of the matrix entries as real or complex 
numbers. We have trace A = traceS and trace A = traceT (by 10.11). 
Thus trace S = trace T. Similarly, we have det A = det S and det A = det T 
(by 10.33). Thus det S = det T. 


Suppose V is an inner-product space and T € L(V). Prove that 
det T* = det T. 


Use this to prove that |det T| = det VT*T, giving a different proof than was 
given in 10.37. 


SOLUTION: Let n=dimV. If A € F, then ((T — AZ)")* = (T* — AN)", 
which implies that 


dim null(T' — AJ)" = dim null(T* — AF)", 


where we have used Exercise 31 in Chapter 7. The equation above shows that 
the eigenvalues of T* are precisely the complex conjugates of the eigenvalues 
of T, with the same multiplicities. If F = C, the determinant equals the 
product of the eigenvalues, counting multiplicity, so we have det T* = det T 
(so far just on complex vector spaces). 

Now suppose F = R, so we must consider eigenpairs. If a, € R, then 


dim null(T? + aT + BI)" = dim null((T*)? + aT* + BI)", 


again by Exercise 31 in Chapter 7. Thus T and T* have the same eigenpairs 
with the same multiplicities. From the paragraph above, T and T* have 
the same eigenvalues with the same multiplicities. Because the determinant 
equals the product of the eigenvalues (counting multiplicity) times the prod- 
uct of the second coordinates of the eigenpairs (counting multiplicity), this 
implies that det T* = det T = det T. 
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At this point, we know that det T* = det T regardless of whether F = C 
or F = R. Thus 


(det VT*T)? = (det VT*T) (det /T*T) 
= det(T*T) 
= (det T*)(det T) 
= (det T)(det T) 
= |det TẸ. 


Taking square roots (and recalling that the positive operator VT*T has a 
nonnegative determinant), we have det VT*T = |det T|, as desired. 


Let a,b,c be positive numbers. Find the volume of the ellipsoid 


{(z,y,z) E R°: <1} 


Drita 
by finding a set 2 C R whose volume you know and an operator T € L(R?) 
such that T(Q) equals the ellipsoid above. 


SOLUTION: Let E denote the ellipsoid defined above and let 2 be the 
ball with radius 1 defined by 


0 = {(z,y,2z) € R3 : 1? ty? +27 <1}. 
Of course volume = fr. Define T € L(R?) by 
T(z, y,z) = (az, by, cz). 
If (x,y,z) € Q, then 
(aa)? | (by)? | (cz)? 


a2 b2 P 
<1, 


r Hy +27 


which shows that T (x,y,z) € E. Thus T(Q) c E. 
Conversely, if (£z, y, z) € E, then 
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which shows that (Z, #,2) € Q. Because T(2, ¥, 2) = (x,y,z), this implies 
that E C T(Q). 
The last two paragraphs show that E = T(Q). Thus 


volume E = volume T(Q) 
= |det T|(volumeQ) 
4rabe 
a oe 


where the second equality comes from 10.38. 


